o 
t-« 
a. 



UTILITY 
^>?TENT APPLICATION 
TRANSMITTAL 

(Only for /?ew nonprowsional applications under 37 CFR 1.53(b)) 


Attorney Docket No. 


0660-0155-0 DIV ™*£ S 


First Inventor or Application Identifier 


Michel ARTHUR 


Title 


inC— J _ j. 

POLYPEPTIDES IMPLICATED IN THE EXPRESSION OF RESISTANCE TOB 
GLYCOPEPTIDES IN PARTICULAR IN GRAM-POSITIVE BACTERIA," 1 " 1 ^ 
NUCLEOTIDE SEQUENCE CODING FOR THESE POLYPEPTIDES AND I 
USE FOR DIAGNOSIS I 



APPLICATION ELEMENTS 

See MPEP chapter 600 concerning utility patent application contents 



Assistant Commissioner for Patents 
ADDRESS TO: Box Patent Application 
Washington, DC 20231 



Fee Transmittal Form (e.g. PTO/SB/17) 

(Submit an original and a duplicate for fee processing) 



2. ■ Specification 



Total pp. 76 



Drawing(s) (35 U.S. C. 113) Total Sheets 



69 



4. 



I Oath or Declaration Total Pages 

a. □ Newly executed (original or copy) 

b ■ Copy from a prior application (37 C.F 
— (for continuation/divisional with box 75 comph 

□ DELETION OF INVENTOR(S) 

Signed r*~* * "** — ' — ' — : - 

in The pr 
1.33(b). 



6. 

7. 

8. 

9. 
10. 
11. 



□ 
D 
□ 
□ 
□ 



37 C.F.R. §3.73(b) Statement 
(when there is an assignee) 



I. 



Signed statement attached deleting inventorfs) named 
in the prior application, see 37 C.F.R. §1. 63(d)(2) and 



12. □ 

13. □ 

14. ■ 



ACCOMPANYING APPLICATION PARTS 

Assignment Papers (cover sheet & documents)) 

o Power of Attorney 

English Translation Document (if applicable) 

Information Disclosure 
Statement (IDS)/PTO-1449 

Preliminary Amendment 

White Advance Serial No. Postcard 

Small Fntitv Statement filed in prior 

it£m% D SRS'SSSfe status stil ' proper 

Certified Copy of Priority Document(s) 
(if foreign prionty is claimed) 



Copies of IDS 
Citations 



Other: 



Request for Priority 



* 5. 



Incorporation By Reference (usable ifbox4B is checked) 
The entire disclosure of the prior application, from which a copy of 
the oath or declaration is supplied under Box 4B, is considered to be 
part of the disclosure of the accompanying application and is hereby 
incorporated by reference therein. 



ft 5. If a CONTINUING APPLICATION, check appropriate box, and supply the requisite information below: 

□ Continuation ■ Divisional o Continuation-in-part (CIP) of prior application no.: 08/980,357 

J Prior application information: Examiner: HORLICK Group Art Unit: 1634 



16. Amend the specification by inserting before the first line the sentence 
■ This application is a □ Continuation ■ Divisional 
of application Serial No. 08/980,357 Filed on 



□ Continuation-in-part (CIP) 

November 28, 1997, now pending, which is a Divisional of U.S. 
Application Serial No. 08/286,819, now U.S. Patent No. 5,871,910, 
which is a Continuation of U.S. Application Serial No. 08/174,682, 
now abandoned, which is a Continuation of U.S. Application Serial 
No. 07/917,146, now abandoned, which was filed as International 
Application Serial No. PCT/FR9 1/00855, filed October 29, 1991. 



o This application claims priority of provisional application Serial No. 



Filed 



17. CORRESPONDENCE ADDRESS 

OBLON, SPIVAK, MCCLELLAND, MAIER & NEUSTADT, P.C. 
FOURTH FLOOR 
1755 JEFFERSON DAVIS HIGHWAY 
ARLINGTON, VIRGINIA 22202 
(703J 413-3000 
FACSIMILE: (703)413-2220 



Name: 


Norman F. Obion 


Registration No.: 


24,618 


Signature: 




Date: 




Name: 


James 1. Kelly, F^h.D. 


Registration No.: 


41,504 



1 



Polypeptides implicated in the expression of resistance to 
glycopeptides* in particular in Gram-positive bacteria. Nucleotide 

sequence coding for these polypeptides and use for diagnosis 

-v 

The invention relates to the polypeptides associated with 
the expression of resistance to antibiotics of the glycopeptide family, 
in particular in Gram-positive bacteria, in particular in the family 
of the Gram-positive cocci* The invention also relates to a nucleotide 
sequence coding for these polypeptides* It also relates to the use 
of these polypeptides and their nucleotide sequence as agents for the 
in vitro detection of resistance to glycopeptides. Among the 
Gram-positive cocci, the invention relates most particularly to the 
enterococci, the streptococci and the staphylococci which are of 
particular importance for the implementation of the invention* 

The glycopeptides, which include vancomycin and teicoplanin 
are antibiotics which inhibit the synthesis of the bacterial cell wall. 
These antibiotics are very much used for the treatment of severe 
infections due to Gram-positive cocci (enterococci, streptococci and 
staphylococci), in particular in the of allergy and resistance 

to the penicillins. In spite of long clinical usage of vancomycin, 
this antibiotic has remained active towards almost all of the strains 
up to 1986, the date at which the first resistant strains were isolated. 
Since then, resistance to the glycopeptides has been detected by many 
microbiologists in Europe and in the United States, in particular in 
strains isolated from immunodepressive patients, making necessary a 
systematic evaluation of the sensitivity of the microbes in hospital 
environments . 

The activity of the glycopeptides depends on the formation 
of a complex between the antibiotic and the precursors of the 
peptidoglycan, more than on the direct interaction with enzymes of 
cell wall metabolism. In particular, it has been observed that the 
glycopeptides bind .to the terminal D-alanyl-D-alanine residues (D-ala- 
D-ala) of the precursors of the peptidoglycan* 

The recent emergence of resistance to the glycopeptides, 
in particular in the enterococci, has led to certain results being 
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obtained with regard to knowledge of the factors conferring this 
resistance. 

For example it has been observed in a particular strain of 
enterococci, Enterococcus faecium BM4147, that the determinant of 
5 resistance to the glycopeptides is localized on a plasmid of 34 kb, 

the plasmid pIP816. This determinant has been cloned in E.coli (Brisson 
Noel et al., 1990, Antimicrob Agents Chemother 34, 924-927). 

According to the results hitherto obtained, the resistance 
to the glycopeptides is associated with the production of a protein 

10 of molecular weight of about 40 kDa, the synthesis of this protein 

being induced by sub-inhibitory concentrations of certain glycopeptides 
such as vancomycin. 

By carrying out a more detailed study of the resistance of 
certain strains of Gram-positive cocci towards glycopeptides, in 

15 particular vancomycin or teicoplanin, the inventors have observed that 

this resistance might be linked to the expression of several proteins 
or polypeptides encoded in sequences usually borne by plasmids in the 
resistant strains. The recent results obtained by the inventors also 
make it possible to distinguish the genes coding for two phenotypes 

20 0 f resistance, on the one hand strains highly resistant to the 

glycopeptides, and, on the other, strains with a low level of 
resistance. 

By strain with a high level of resistance is meant a strain 
of bacteria, in particular a strain of Gram-positive cocci, for which 
25 the minimal inhibitory concentrations (MIC) of vancomycin and 

teichoplaninare higher than 32 and 8 ug/ml, respectively. The MIC of 
vancomycin towards strains with low-level resistance are included 
between 16 and 32 ug/ml. These strains are apparently sensitive to 
teicoplanin. 

30 The inventors have isolated and purified, among the components 

necessary for the expression of the resistance to the glycopeptides, 
a particular protein designated VANA or VanA which exhibits a certain 
homology with D-alanine-D-alanine ligases. VanA is nonetheless 
functionally distinct from the ligases. 

35 In principle, a gene sequence will be designated by "van..." 
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and an amino acid sequence by "Van*.." 

The invention relates to polypeptides or proteins implicated 
in the expression of resistance to antibiotics of the glycopeptide 
family and, in particular, to vancomycin and/or teicoplanin as we u 
as to the nucleotide sequences coding for such complexes* 

The invention also relates to nucleotide probes which can 
be used for the detection of resistance to the glycopeptides, in 
particular by means of the polymerase chain reaction (PCR), or by tests 
involving antibodies. 

The invention relates to a composition of polypeptides, 
characterized in that it contains at least one protein or part of a 
protein selected from the amino acid sequences identified in the list 
of the sequences as SED ID NO 1 (VanH), SEQ ID NO 2 (VanA), SEQ ID 
NO 3 (VanX) or SEQ ID NO 19 (VanC), or any protein or part of a protein 
15 recognized by the antibodies directed against VanH, VanA, VanX or VanC, 

or any protein or part of a protein encoded in a sequence hybridizing 
with one of the nucleotide sequences identified in the list of the 
sequences as SEQ ID NO 8, SEQ ID NO 9, SEQ ID NO 10 or SEQ ID NO 21 
or with one of the following sequences VI or V2 under stringent or 
20 only slightly stringent conditions: 

VI ; GGX GAA GAT GGX TCX TTX CAA GGX 
G C AG C G 
A 

25 V2 : AAT ACX ATX CCX GGX TTT AC 

C T C 
C 

A first particular composition according to the invention 
implicated in the expression of the resistance to the glycopeptides 

30 is characterized in that it comprises at least 3 proteins or any part 

of one or more of these proteins necessary to confer to Gram-positive 
bacteria the resistance to antibiotics of the glycopeptide family, 
in particular to vancomycin and/or teicoplanin or to promote this 
resistance, in particular in strains of the family of the Gram-positive 

35 cocci, these proteins or parts of proteins being 



a) recognized by antibodies directed against one of the sequences 
identified in the list of the sequences as SEQ ID NO 1, SEQ 
ID NO 2, SEQ ID NO 3, 

b) or encoded in genes containing a sequence identified as SEQ ID NO 
8, SEQ ID NO 9 or SEQ ID NO 10 or hybridizing with one of these 
sequences or its complementary sequence or with the sequences VI 
or V2, under stringent or only slightly stringent conditions. 

These sequences are also designated, respectively, by 0RF3, 
0RF1 containing the gene VanH, vanA (or 0RF2); they characterize the 
proteins responsible for resistance as obtained from the strain 
Enterococcus faecium BMA147 described by Leclerq et al (N. Engl. J, 
Med, 319:157-161). 

Another protein, VanC, related to the D-Ala-D-Ala ligases 
but of different specificity has been characterized in Enterococcus 
gallinarum BM4173; the vanC gene possesses domains having sufficient 
homology with the vanA gene for probes corresponding to defined regions 
of vanA to make possible its detection. 

E. gallinarum is a constitutive isolate resistant to low levels 
of vancomycin (Dutka-Malen et al., Antimicrob. Agents Chemother 34 
(1990b) 1875-1879). 

By the expression "polypeptides" is meant any sequence of 
amino acids constituting proteins or being of a size less than that 
of a protein. 

The stringent conditions mentioned above are defined according 
to the usual conditions pertaining to the hybridization of nucleotide 
sequences. As an example, in the case of the sequences which hybridize 
with the sequence of the vanA gene (SEQ ID NO 8) it will be possible 
to apply the following conditions: 

- for hybridization under conditions of high stringency: 

* a reaction temperature of 65°C overnight in a solution containing 
0.1% SDS, 0.7% skimmed milk powder, 6xSSC (IxSSC = 0.15 M NaCl and 
0.015 M sodium citrate at pH = 7.0) 

* washes at 65°C in 2xSSC - 0.1% SDS; 

- for hybridization under slightly stringent conditions, the 
hybridization temperature is 60°C overnight and the temperature 
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of the washings is 45°C. 

The expression of resistance to glycopeptides may be expressed 
by the persistence of an infection due to microbes usually sensitive 
to the glycopeptides. 

A polypeptide or a protein is necessary for the expression 
of resistance to the glycopeptides, if its absence makes the strain 
which contains this polypeptide or this protein more sensitive to the 
glycopeptides and if this polypeptide or protein is not present in 
sensitive strains. 

Different levels of resistance to the glycopeptides exist 
in the strains of Gram-positive cocci, in particular. 

According to a preferred embodiment of the invention, the 
polypeptides included in the composition defined above correspond to 
the combination of the proteins identified in the list of the sequences 
15 as SEQ ID NO 1 (VanH), SEQ ID NO 2 (VanA), SEQ ID NO 3 (VanX). 

The inventors have thus observed that the expression of 
resistance to the glycopeptides in Gram-positive bacteria requires 
the expression of at least three proteins or of polypeptides derived 
from these proteins. 

20 According to a first particular embodiment of the invention, 

the polypeptides of the composition are also characterized in that 
the amino acid sequences necessary for the expression of resistance 
to antibiotics of the glycopeptide family are under the control of 
regulatory elements, in particular of the proteins corresponding to 

25 the sequences designated by SEQ ID NO 4 and SEQ ID NO 5 in the list 

of the sequences, and which correspond to a regulatory sequence R and 
to a sensor sequence S, respectively. 

VanS and VanR constitute a two-component regulatory system, 
VanR being an activator of transcription and VanS stimulating the 

30 transcription dependent on VanR. VanS is capable of modulating the 

level of phosphorylation of VanR in response to the vancomycin present 
in the external medium and is thus involved in the control of the 
transcription of the genes for resistance to vancomycin. 

These regulatory sequences are in particular capable of 

35 increasing the level of resistance, to the extent to which they promote 



6 



the expression of the proteins responsible for resistance comprised 
in the polypeptides of the invention. 

According to another advantageous embodiment of the invention, 
the polypeptides of the above composition are encoded in the sequence 
5 SEQ ID NO 6 identified in the list of the sequences, which represents 

the sequence coding for the 5 proteins previously described. 

Another sequence according to the invention is designated 
by SEQ ID NO 11 which contains the sequence SEQ ID NO 6 as well as 
a sequence upstream from SEQ ID NO 6 coding for a transposase (encoded 

10 in the (-) strand of the sequence, and a sequence downstream from SEQ 

ID NO 6 corresponding to the genes vanY and vanZ and at each end reverse 
repeated sequences of 38 bp. SEQ ID NO 11 constitutes a transposon, 
the genes of which are implicated at different levels in the 
establishment of resistance to the glycopeptides. 

15 The invention also relates to the purified proteins belonging 

to the composition and to the polypeptides described previously. In 
particular, the invention relates to the purified protein VanA, 
characterized in that it corresponds to the amino acid sequence SEQ 
ID NO 2 in the list of the sequences or a protein VanC, encoded in 

20 a gene capable of hybridizing with the vanA gene. 

The protein VanA contains 343 amino acids and has a calculated 
molecular mass of 37400 Da. The protein VanC contains 343 amino acids 
and has a calculated molecular mass of 37504 Da. 

Other interesting proteins in the framework of the invention 

25 correspond to the sequences identified as SEQ ID NO 1 (VanH), SEQ ID 

NO 3 (VanX), SEQ ID NO 4 (VanR), SEQ ID NO 5 (VanS) in the list of 
the sequences. 

The sequence identified by the abbreviation SEQ ID NO 1 
contains the protein VanH encoded in the gene vanH, this protein 

30 contains 322 amino acids and begins with a methionine. This protein 

is an enzyme implicated in the synthesis of the peptidoglycan and has 
a molecular mass of 35,754 kDA. VanH exhibits some similarities to 
dehydrogenases which catalyze the NAD"*~-dependent oxidation of 2-hydroxy- 
car boxy lie acids to form the corresponding 2-keto-carboxylic acids. 

35 In fact, the VanH protein might use NADP + rather than NAD + . The VanH 
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protein also contains several residues of reactive sites which probably 
participate directly in the binding of the substrate and in catalysis. 
VanH might be implicated in the synthesis of a substrate of the ligase 
VanA. This substrate of VanA might be a D-ofc-hydroxy-carboxylic acid, 
5 which might be condensed by VanA with D-alanine in the place of a D- 

amino acid, which might affect the binding of the precursor of the 
peptidoglycan with vancomycin, as a result of the loss of a hydrogen 
bond because one of the hydrogen bonds formed between vancomycin and 
N-acetyl-D-Ala-D-Ala occurs with the NH group of the terminal D-alanine 
10 residue* Let it be recalled that "Ala ,! is the abbreviation for 

"alanine". 

The inventors have been able to detect some interactions 
between the proteins VanA and VanH and have in particular been able 
to describe the following : the nature of the VanA protein (D-alanine: 

15 D-alanine ligase with reduced specificity for its substrate) which 

has made possible resistance to glycopeptides, implies the biosynthesis 
by VanA of a novel compound different from D-Ala-D-Ala, a peptide which 
may be incorporated into the peptidoglycans but which is not recognized 
by vancomycin* In particular, the observation of similarities between 

20 the product of the vanH gene and the D-specific &-keto-acid reductases 

has made it possible to determine that this compound cannot be a D- 
amino acid but is a D* hydroxy acid, which when it is bound to D-alanine 
by VanH, can generate the novel depsipeptide precursor of the 
peptidoglycan. 

25 The invention also relates to any combination of these 

different proteins in a resistance complex, as well as to hybrid 
proteins comprising one or several of the above proteins, or part of 
these proteins, in combination with a defined amino acid sequence. 

Also included in the framework of the invention are nucleotide 

30 sequences coding for one of the amino acid sequences described above. 

A particular sequence is the nucleotide sequence of about 
7.3 kb, corresponding to the Hind lll -EcoR I restriction fragment, such 
as that obtained starting from the plasmid pIP816 described in the 
publication of Leclerq et al - 1988, cited above. 

35 This sequence of 7.3 kb comprises the nucleotide sequence 
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coding for the 3 resistance proteins and the 2 regulatory proteins 
referred to above* This coding sequence is included in an internal 
Bglll-Xbal fragment. It also comprises a part of the sequences coding 
for the transposase and the resolvase. 
5 The invention also relates to any nucleotide fragment 

comprising the above-mentioned restriction fragment as well as any 
part of the Hindlll-EcoRI fragment, in particular the EcoRI-Xbal 
fragment of about 3.4 kb coding for the 3 resistance proteins or the 
EcoRV-SacII fragment of about 1.7 kb coding for VanA or also Hindlll- 
10 EcoR I fragment of about 3.3 kb coding for the 2 regulatory proteins 

VanR and VanS. 

Another definition of a nucleotide sequence of the invention 
corresponds to a nucleotide fragment containing the following 
restriction sites in the following order, such as obtained starting 

15 from pIP816 mentioned above: 

Hindlll, Bglll, Bglll, EcoRI, BamHI, Xbal , EcoRI. 

Another nucleotide sequence according to the invention is 
characterized in that it corresponds to a sequence selected from the 
sequences identified as SEQ ID NO 7, SEQ ID NO 6, SEQ ID NO 11 or SEQ 

20 ID NO 22 , or in that it includes this sequence or any part of this 

sequence, or also any sequence or part of the sequence of the 
complementary DNA or any sequence of RNA corresponding to one of these 
DNAs , capable , 

- either of constituting a hybridization probe for the detection of 
25 resistance to antibiotics of the glycopeptide family, in particular 

to vancomycin and/or teicoplanin in particular in strains of the family 
of the Gram-positive cocci, 

- or of coding for a sequence necessary or associated with the 
expression of resistance to antibiotics of the glycopeptide family, 

30 in particular to vancomycin and/or teicoplanin., in particular in strains 

of the family of the Gram-positive cocci. 

The sequence SEQ ID NO 7 codes for the 3 resistance proteins 
VanH, VanA and VanX. 

The sequence SEQ ID NO 22 and the sequence SEQ ID NO 11 
35 include a transposon shown in Figure 7a; this transposon contains the 



genes necessary for the expression of resistance to the glycopeptides 
as well as the genes associated with this resistance implicated, for 
example, in the regulation of the expression of the genes necessary 
to produce the resistance phenotype or implicated in the amount of 
resistance polypeptide produced. 

A specific sequence corresponding to the above definition 
is one of the following sequences: 
VI : GGX GAA GAT GGX TCX TTX CAA GGX 

G C AG C G 
or A 
V2 : AAT ACX ATX CCX GGX TTT AC 
C T T 

C 

VI and V2 make possible the constitution of probes, if 
necessary, in combination with other nucleotides, depending on the 
degree of specificity desired in order to detect vanA and vanC and 
may also be used as primers in polymerase chain reactions. 

Other preferred nucleotide sequences are the sequences SEQ 
ID NO 8, SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 21, SEQ ID NO 12 
(transposase), SEQ ID NO 13 (resolvase), SEQ ID NO 14 (vanY), SEQ ID 
NO 15 (vanZ), SEQ ID NO 23 (vanR), SEQ ID NO 24 (vanS) or a variant 
of one of these sequences provided that it codes for a protein having 
immunological and/or functional properties similar to those of the 
proteins encoded in the sequences SEQ ID NO 8 (vanA), SEQ ID NO 9 
(vanH), SEQ ID NO 10 (vanX), or SEQ ID NO 21 (vanC), SEQ ID NO 12 
(transposase), SEQ ID NO 13 (resolvase), SEQ ID NO 14 (vanY), SEQ ID 
NO 15 (vanZ), SEQ ID NO 23 (vanR), SEQ ID NO 24 (vanS) or in that it 
makes possible the detection of strains resistant to antibiotics of 
the glycopeptide family. 

Variants include all of the fragments of the sequences having 
the following properties. 

These sequences code for the resistance proteins VanH, VanA 

and VanX. 

The nucleotide sequence designated by SEQ ID NO 8 corresponds 
to a DNA fragment of 1029 bp situated between the ATG codon at position 
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377 and the TGA codon at position 1406 on the plasmid pAT214 (Fig, 
6). 

The invention also relates to a nucleotide sequence coding 
for the sequence SEQ ID NO 6 corresponding to the sequence coding for 
the 5 proteins (2 regulatory proteins and 3 resistance proteins), and 
also comprising the flanking sequences associated with these coding 
sequences, or comprising this sequence. 

Also included in the framework of the invention is a sequence 
modified with respect to SEQ ID NO 6, characterized in that it lacks 
the flanking sequences. These flanking sequences are the sequences 
shown in the following pages and defined as follows: 

- sequence upstream from the sequence coding for R: between the bases 
1 and 1476 of the sequence shown in Figure 5, 

- sequence between the sequence coding for the sensor protein S and 
0RF1: between the bases 3347 and 3500 of the sequence shown in Figure 
5, 

- sequence downstream from the sequence coding for 0RF3: between the 
bases 6168 and 7227 of the sequence shown in Figure 5. 

The sequence designated by SEQ ID NO 6 is also characterized 
by the fragment bearing the restriction sites in the following order: 
BgllH - EcoRI - BamHI - EcoRI 

The location of the regulatory proteins and the resistance 
proteins is shown in Figure 3. 

The inventors have identified upstream and downstream from the 
genes vanR, vanS, vanH, vanA and vanX, which are necessary for or 
associated with the expression of resistance to glycopeptides at a 
given level, genes coding for a transposase and a resolvase (upstream 
from the group previously mentioned) and genes vanY and vanZ, downstream 
from this group. The genes for the transposase and resolvase might 
be implicated in transposition functions and the vanY gene coding for 
a D,D-carboxy peptidase might be implicated in the metabolism of the 
peptidoglycan, and might contribute to resistance to the glycopeptides 
i n E. faecium BM4147 even though vanR, vanS, vanH, vanA and vanX borne 
by a plasmid in a high number of copies, alone confer a high level 
of resistance. 
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Let it be noted that the sequence coding for the transposase 
is located on the (-) strand of the sequence ID NO 22 which codes for 
vanR, vanS, vanH, vanA, vanX, vanY, vanZ and the resolvase. 

The invention relates not only to the DNA sequences identified 
5 in the list of the sequences but also to the complementary DNA sequences 

and the corresponding RNA sequences. The invention concerns in addition 
sequences which are equivalent to the former, either in terms of 
expression of proteins, polypeptides or their fragments described above, 
or in terms of the capacity to detect, for example by chain 
10 polymerization procedures, strains of Gram-positive bacteria exhibiting 

resistance to antibiotics of the glycopeptide family such as vancomycin 
or teicoplanin. 

Recombinant sequences characterized in that they comprise 
one of the above nucleotide sequences also form part of the invention* 
15 The invention also relates to a recombinant vector 

characterized in that it includes one of the above nucleotide sequences 
at a site inessential for its replication, under the control of 
regulatory elements likely to be implemented in the expression of the 
resistance to antibiotics of the glycopeptide family, in particular 
20 to vancomycin or teicoplanin, in a defined host. 

Particularly advantageous recombinant vectors for the 
implementation of the invention are the following vectors: pAT214 
containing the EcoR V -SacI I fragment of 1761 bp containing a nucleotide 
sequence coding for the VanA protein; in these vectors the sequences 
25 of the invention are advantageously placed under the control of 

promoters such as the lac promoter. 

The invention also relates to a recombinant cell host 
containing a nucleotide sequence such as that previously described 
or a vector such as that described above under conditions which make 
30 possible the expression of resistance to antibiotics of the glycopeptide 

family, in particular resistance to vancomycin and/ or 
this host being for example selected from the bacteria, in particular 
the Gram-positive cocci. 

In certain applications it is also possible to use yeasts, 
35 fungi, insect or mammalian cells. 



12 



The invention also relates to a nucleotide probe characterized 
in that it is capable of hybridizing with a sequence previously 
described, this probe being labelled if necessary. These probes may 
or may not be specific for the proteins of resistance to glycopeptides. 

Labels which can be used for the requirements of the invention 
are the known radioactive labels as well as other labels such as 
enzymatic labels or chemoluminescent labels. 

Probes thus labelled may be used in hybridization tests in 
order to detect resistance to glycopep tides in Gram-positive bacteria. 
In this case, conditions of low stringency will be used. 

Nucleotide probes according to the invention may be 
characterized in that they are specific in Gram-positive bacteria for 
the sequences coding for a resistance protein to the glycopep-fcides, in 
particular to vancomycin and/or teicoplanin these probes being in 
addition universal among these sequences. 

By these specific probes is meant any oligonucleotide 
hybridizing with a nucleotide sequence coding for one of the proteins 
according to the invention, such as described in the preceding pages, 
and not exhibiting a cross hybridization reaction or amplification 
reaction (PCR) with sequences present in all of the sensitive strains. 

The universal character of the oligonucleotide which can 
be used in PCR is defined by their capacity to promote specifically 
the amplification of a nucleotide sequence implicated in resistance 
in any one strain of Gram-positive bacteria, resistant to the 
antibiotics of the glycopeptide family. 

The size of the nucleotide probes according to the invention 
may vary depending on the use desired. For the oligonucleotides which 
are used in PCR, recourse will be had to fragments of a length which 
is usual in this procedure. In order to construct probes, it is possible 
to take any part of the sequences of the invention, for example probe 
fragments of 200 nucleotides. 

According to a particular embodiment of the invention, a 
nucleotide probe is selected for its specificity towards a nucleotide 
sequence coding for a protein necessary for the expression in Gram- 
positive bacteria of a high level of resistance to antibiotics of the 
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glycopeptide family, in particular to vancomycin and teicoplanin. 

As examples, useful probes may be selected from the intragenic 
part of the vanA gene. 

Other useful probes for carrying out the invention are 
5 characterized by their universal character, according to the preceding 

definition, but are not specific for the resistance genes . They may 
also be used as primers in PGR, and are for example: 
VI : GGX GAA GAT GGX TCX TTX CAA GGX 
G C AG C G 
10 A 

V2 : AAT ACX ATX CCX GGX TTT AC 
C T C 
C 

VI and V2 hybridize with vanA and vanC and are capable of 
15 leading to the detection of proteins associated with resistance to 

glycopeptides in other micro-organisms. 

Other particular probes of the invention have the specific 
character of a nucleotide sequence coding for a protein necessary for 
the expression in Gram-positive bacteria of a low level of resistance 
2 ^ to antibiotics of the glycopeptide family, in particular to vancomycin 

in Gram-positive bacteria. 

It should also be mentioned that oligonucleotide probes which 
might be derived from the sequence of the vanA gene coding for the 
VanA protein may be used indiscriminantly to detect high-level or low- 
25 level resistance. 

In a particularly preferred manner, a probe of the invention 
is characterized in that it hybridizes with a chromosomal or 
non-chromosomal nucleotide sequence of a Gram-positive strain resistant 
to glycopeptides, in particular to vancomycin and/or teicoplanin , in 
particular in that it hybridizes with a chromosomal or non-chromosomal 
nucleotide sequence of a strain of Gram-positive cocci, for example 
an enterococcal strain and preferably E. faecium 4147 or E. gallinarum . 

In order to distinguish strains with a high level of 
resistance from strains with a low level of resistance it is possible 
to carry out a hybridization test using conditions of high stringency* 
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The oligonucleotides of the invention may be obtained from 
the sequences of the invention by cutting with restriction enzymes, 
or by chemical synthesis according to the standard methods. 

Furthermore, the invention relates to polyclonal or monoclonal 
antibodies, characterized in that they recognize the polypeptide(s) 
described above or an amino acid sequence described above. 

These antibodies may be obtained according to standard methods 
for antibody production. In particular, in the case of the preparation 
of monoclonal antibodies, recourse will be had to the method of Kohler 
and Milstein according to which monoclonal antibodies are prepared 
by cell fusion between myeloma cells and mouse spleen cells previously 
immunized with a polypeptide or a composition according to the 
invention, in conformity with the standard procedure. 

The antibodies of the invention can advantageously be used 

for the detection of the presence of proteins characteristic of 

resistance to the glycopeptides, in particular to vancomycin and 
teicoplanin. 

Particularly useful antibodies are polyclonal or monoclonal 
antibodies directed against the protein VanA or VanC. Such antibodies 
advantageously make it possible to detect strains of bacteria, in 
particular Gram-positive cocci, exhibiting high-level resistance to 
the antibiotics of the glycopeptide family. If necessary, a step 
entailing lysis of the cells of the sample undergoing detection is 
performed prior to the placing in contact of the sample with the 
antibodies. 

In order to carry out this detection, recourse will 
advantageously be had to antibodies labelled for example with a 
radioactive substance or other type of label. 

Hence, tests for the detection in Gram-positive bacteria 
of resistance to the glycopeptides, in particular tests making use 
of the ELISA procedures, are included in the framework of the invention. 

A kit for the in vitro diagnosis of the presence of Gram- 
positive strains, resistant to the glycopeptides, in particular to 
vancomycin and/or teicoplanin, these strains belonging in particular 
to the Gram-positive cocci for example enterococci, for example 
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f aecium or E. gallinarum is characterized in that it comprises: 

- antibodies corresponding to the above definition, labelled if 
necessary , 

- a reagent for the detection of an immunological reaction of the 
antigen-antibody type, 

- if necessary, reagents to effect the lysis of the cells of the sample 
to be tested. 

Furthermore, the agents developed by the inventors offer 
the very useful advantage of being suitable for the development of 
a rapid and reliable test or kit for the detection of Gram-positive 
strains resistant to the glycopeptides by means of the polymerase chain 
reaction (PCR) . Such a test makes it possible to improve the sensitivity 
of the existing tests which remain rather unreliable and, in certain 
cases, may make possible the detection of all of the representatives 
of the family of the genes coding for resistance proteins to the 
glycopeptides in Gram-positive bacteria. 

The carrying out of a test by means of the method of 
amplification of the genes of these proteins is done by the PCR 
procedure or by the RPCR procedure (RPCR : abbreviation for reverse 
polymerase chain reaction). 

The RPCR technique makes possible the amplification of the 
NH 2 and COOH terminal regions of the genes it is desired to detect. 

Some specific primers make it possible to amplify the genes 
of the strains with low-level resistance. These primers are selected, 
for example, from the sequence coding for the resistance protein VanA. 

As examples, the following sequences can be used as primers 
for the preparation of probes for the detection of an amplification 
by means of the PCR or RPCR method. 
VI : GGX GAA GAT GGX TCX TTX CAA GGX 
G C AG C G 
A 



V2 : AAT ACX ATX CCX GGX TTT AC 
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X represents one of the bases A,T,C or G or also corresponds 
in all cases to inosine. 

Naturally, the invention relates to the complementary probes 
of the oligonucleotides previously described as well as possibly to 
the RNA probes which correspond to them. 

A kit for the in vitro diagnosis of the presence of strains 
of Gram-positive bacteria resistant to the glycopeptides, in particular 
resistant to vancomycin and/or teicoplanin these strains belonging 
in particular to the Gram-positive cocci, in particular that they are 
strains of enterococci, for example E« faecium or E« gallinarum , is 
characterized in that it contains: 

- a nucleotide probe complying with the above specifications and if 
necessary, 

- oligonucleoside triphosphates in an amount sufficient to make 
15 possible the amplification of the desired sequence, 

- a hybridization buffer, 

- a DNA polymerization agent. 

The invention also relates to a procedure for the in vitro 
detection of the presence of Gram-positive strains resistant to the 
20 glycopeptides, in particular to vancomycin and/or teicoplanin. these 

strains belonging in particular to the family of the Gram-positive 
cocci, in particular in that they are strains of enterococci, for 
example E. faecium or E« gallinarum , characterized in that it comprises: 

a) the placing of a biological sample likely to contain the resistant 
25 strains in contact with a primer constituted by a nucleotide 

sequence described above, or any part of a sequence previously 
described, capable of hybridizing with a desired nucleotide sequence 
necessary for the expression of resistance to the glycopeptides, 
this sequence being used as matrix in the presence of the 4 
30 different nucleoside triphosphates and a polymerization agent under 

conditions of hybridization such that for each nucleotide sequence 
which has hybridized with a primer, an elongation product of each 
primer complementary to the matrix is synthesized, 

b) the separation of the matrix from the elongation product obtained, 
35 this latter then also being capable of behaving as a matrix, 
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c) the repetition of step a) so as to produce a detectable amount 
of the desired nucleotide sequences, 

d) the detection of the product of amplification of the nucleotide 
sequences. 

The detection of the elongation products of the desired 
sequence may be carried out by a probe identical with the primers used 
to carry out the PCR or RPCR procedure, or also by a probe different 
from these primers, this probe being labelled if necessary. 

Details relating to the implementation of the PCR procedures 
may be obtained from the patent applications EP 0229701 and EP 0200362. 

Other advantages and characteristics of the invention will 
become apparent in the examples which follow and from the figures. 

FIGURES 

- Figure 1 : electrophoresis on SDS-polyacrylamide gel (SDS-PAGE) of 
the proteins of the membrane fractions line 1 and line 4 f molecular 
weight standards; line 2, E. faecium BM4147 placed in culture in the 
absence of vancomycin; line 3, EM4147 placed in culture in the presence 
of 10 Mg/ml of vancomycin. The head of the arrow indicates the position 
of the VanA protein. 

- Figure 2 : 

A : Restriction maps of the inserts of the plasmids pAT213 and pAT214. 
The vector and the DNA insert are distinguished by light and dark 
segments, respectively. The open arrow represents the vanA gene. 
B : Strategy for the nucleotide sequencing of the insert of 1761 bp 
in the plasmid pAT214. The arrows indicate the direction and extent 
of the sequencing reactions by the dideoxy method. The synthetic 
oligonucleotide primer (5 T ATGCTCCTGTCTCCTTTC 3 1 OH) is complementary 
to the sequence between the positions 361 and 378. Only the pertinent 
restriction sites are given. 

- Figure 3 : position of the sequences R, S, 0RF1, 0RF2, 0RF3. 
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- Figure 4 : representation of SEQ ID NO 6. 

" Figure 5 : representation of SEQ ID NO 6 and the corresponding 
protein. 

- Figure 6 : sequence of the vanA gene and the corresponding protein. 

- Figure 7 ; 

(a) : Localization of the genes vanR, vanS, vanH, vanA, vanX, vanY, 
vanZ of the gene for the transposase and of the gene for the resolvase 
as well as the repeated reverse terminal sequences of 38 bp at the 
end of the transposon. 

(b) : Mapping of the plasmids. (A) Polylinker pAT29 and derivatives 
constructed in this study. The arrow labelled P2 indicates the position 
and orientation of the P2 promoter of aphA-3 (Caillaud et al. t 1987, 
Mol. Gen. Genet. 207:509-513). (B) Insert pAT80. The white rectangles 
indicate the DNA of pAT29 but they are not shown to scale. The 
rectangles terminating in an arrow indicate the coding sequences. The 
arrows shown in vertical and horizontal full lines indicate the position 
and orientation, respectively, of the apha-1 gene in the derivatives 

of pAT80. Restriction sites: Ac, AccI ; B, BamHI; Bg, BglH; Bs, BssHII; 
E, EcoRI; H, Hindlll; He, Hindi; K, Kpnl ; P, PstI; S, Smal; SI, SacI, 
SII, SacII; Sa, Sail ; Sp, SphI ; Xb, Xbal . (C) Inserts in pAT86, pAT87, 
pAT88 and pAT89. The inserts are shown by full lines and the 
corresponding vectors are indicated in parentheses. 

- Figure 8 : nucleotide sequence of the transposon shown in Figure 7 
and amino acid sequence of the corresponding proteins. The nucleotide 
sequence is shown for the (+) strand and for the (-) strand 
(corresponding to the complementary sequence of the (+) strand for 

the positions 1 to 3189) on which the coding sequence of the transposase 
is located. 

" Figure 9 : Nucleotide sequence of the SacI-PstI fragment of 1347 

bp of the plasmid pAT216 containing the vanC gene. The numbering starts 
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at the first base G of the Sac I restriction site. The potential RBS 

sequence upstream from the initiation codon ATG of translation at 

position 215 is underlined. The STOP codon (TGA) is indicated by *. 

The region coding for the vanC and the deduced amino acid sequence 

5 are indicated in bold characters ♦ Sequential overlapping clones were 

generated by restriction fragments of subcloning of pAT216 in the 

bacteriophage M13mpl0 (Amersham, England). The universal primer (New 

England Biolabs Beverly MA) was used to sequence the insert in the 

recombinant phages. The sequencing was performed by the enzymatic 

10 dideoxy nucleotide method (Sanger et al., 1977 PNAS 74: 5463-5467) 

by using the T7 DNA polymerase (Sequenase US B CORP, Cleveland, OH) 
35 

and /*- S7dATP (Amersham, England). The reaction products were loaded 
onto 6% denaturing polyacrylamide gels. 

15 - Figure 10 : alignment of the amino acid sequences of VanC, VanA, 

DdlA and DdlB. The identical (I) amino acids and the conservative (C) 
substitutions in the 4 sequences are indicated in the alignment. In 
order to classify the conservative substitutions, the amino acids were 
grouped as follows: RK, LFPMVI, STQNC, AGW, H, ED and Y. The regions 

20 of high homology corresponding to the domains 1, 2, 3 and 4 are 

underlined. The sequences corresponding to the peptides 1 and 2 are 
indicated by the arrows. 

~ Figure 11 : description of the oligonucleotides VI and V2 (A) : Amino 
25 acid sequence of the peptides 1 and 2 of VanA and of the D-Ala-D-Ala 

ligases. The number of amino acids between the N-termimis and peptide 

1 , between the peptides 1 and 2 and the peptide 2 and the C-terminus 

is indicated. The identical amino acids between at least 2 of the 3 

sequences are indicated in bold characters. 
30 (B) : Target peptides and deduced nucleotide sequence. X represents 

any base of the DNA. Peptide 2 in DdlB differs from the target peptide 

at 2 positions (*). 

(C) : Nucleotide sequence of VI and V2. Alternate nucleotides and 
deoxyinosine (I) which may correspond to any base in the DNA, were 
35 used at the positions at which the nucleotide sequences coding for 
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the target peptides vary. The arrows indicate the direction of DNA 
synthesis. The oligonucleotides were synthesized by the raethoxy- 
phosphoramidite method with a Biosystem DNA 380B machine (Applied 
Biosystem, Foster City, Ca). The DNA was isolated from bacterial lysates 
by extraction with hexadecyl trimethyl ammonium bromide (Inst, 
biotechnologies. Inc., New Haven, CO) (Le Bouguenec et al., 1990, J. 
Bacteriol. 172 :727-734) and used as matrix for the amplification by 
means of PCR with a controlled heating system "Intelligent Heating 
Block" IBH101 (Hybarid Ltd., GB) according to the description of Mabilat 
et al. (1990, Plasmid 23:27-34). The amplification products were 
revealed by electrophoresis on a 0.8% gel, after staining with ethidium 
bromide . 

- Figure 12 : Inactivation by insertion of vanC. The vanC gene is shown 
by an open arrow and the internal EcoR I -Hinc II fragment of 690 bp is 
hatched. The DNA of p ATI 14 is shown by a thin line; the chromosomal 
DNA of PM4174 by a thick line; the arrows indicate the genes for 
resistance to the antibiotics: aphA-3 is the gene coding for the 3'- 

R 

aminoglycoside phosphotransferase; erm is the gene coding for the ER 
methyl transferase. 

(A) : The plasmid pAT217 was constructed by ligation of the EcoRI -Hinc II 
fragment of pAT216 to the suicide vector pAT114 (Trieu-Cuot et al., 
1991, Gene 106:21-27), digested with EcoRI and Smal. 

(B) : vanC region of the chromosomal DNA of BM4174. 

(C) : vanC region after integration of pAT217. 

" Figure 13 : Southern blot analysis of the integration of pAT217 into 
the vanC gene of BM4174. 

(left hand side) : Total DNA of BM4175 (line 2) and BM4174 (line 3) 
digested with EcoR I and resolved by means of electrophoresis on a 1% 
agarose gel. The DNA of the bacteriophage lambda digested with PstI 
was used as molecular mass standard (line 1). The DNA was transferred 
under vacuum to a Nytran membrane (Schleicher and Schiil, Germany) by 
using a Trans-Vac TE80 apparatus (Hofer Scientific Instruments, San 
Francisco, CA) and bound to the membrane through the intermediary 
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of UV light. The hybridization was carried out with the probe C (Middle) 

or the probe aphA-3 specific for pAT114 (Lambert et al., 1985, Annales 

de rinstitut Pasteur/Microbiol . 136(b): 135-150) . 

32 

(right hand side): the probes were labelled with P by nick 
translation. The molecular masses (kb) are indicated. 

" Figure 14 : alignment of the deduced amino acid sequences of VanS 
derived from E. faecium BM4147 and of PhoR and EnvZ from E.coli . The 
numbers on the left refer to the position of the first amino acid in 
the alignment. The numbers on the right refer to the position of the 
last amino acid of the corresponding line. The identical amino acids 
are placed in boxes. The dotted lines indicate gaps introduced in order 
to optimize their similarity. The dashes indicate the positions of 
the amino acid residues conserved in other HPK. The histidine residues 
in bold characters in section 1 are potential sites of auto- 
phosphory lation . 

~ Figure 15 : alignment of the deduced amino acid sequences of VanR 
from E. faecium BM4147, QmpR and PhoB from E. coli as well as that 
of CheY from Salmonella typhimurium . The numbers on the right indicate 
the position of the last amino acid of the corresponding line. The 
identical amino acids are placed in boxes. The dotted lines indicate 
the gaps introduced in order to optimize the homologies. The residues 
in bold characters correspond to the amino acids strongly conserved 
in the effector domains of other RR. The aspartic acid residue 57 of 
CheY is phosphorylated by the HPK associated with CheA. 
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I - IDENTIFICATION OF vanA 

Materials and methods for the identification and characterization 
of the vanA gene 

Bacterial strains and plasmids 

The origin of the plasmids used is given in the table below. 
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Strain or plasmid 



Source or reference 



Escherichia coli 
JM83 
AR1062 
15 JM103 
ST640 

Enterococcus faecium 



Messing (1979) 

Rambach and Hogness (1977) 

Hannshan (1983) 

Lugtenberg and van Schijndel 

van-Dam (1973) 



20 BM4147 

Plasmid pUC18 

pAT213 

pAT214 



Leclercq et al (1988) 
Norrander et al (1983) 
Brisson-Noel et al (1990) 
Described in this text 
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Preparation of the enterococcal membranes 



30 



35 



Enterococcus faecium BM4147 was cultivated in 500 ml of heart- 
brain broth (BHI broth medium) until the optical density (0Dg Q0 ) reached 
0.7. Induction was effected with 10 ng/ml of vancomycin (Eli Lilly 
Indianapolis Ind). The subsequent steps were performed at 4°C. The 
cells were recovered by centrif ugation for 10 minutes at 6000 g, washed 
with a TE buffer (0.01 M TRIS-HC1, 0.002 M EDTA, pH 7.0) and lysed 
by glass beads (100 urn in diameter) in a Braun apparatus for 2 minutes. 
The cell debris were separated by centrif ugation for 10 minutes at 
6000 g. The membranes were collected by centrif ugation for 1 hour at 
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65000 g and resuspended in 0.5 ml of TE buffer. 



Preparation of the minicells 

5 

Plasmids were introduced by transformation into the strain 
E. coli AR1062 prepared in the form of bacterial vesicles. The bacterial 
vesicles were recovered on sucrose gradients and the proteins were 
labelled with 50 *iCi of /^S7-L-methionine (^erg^a^ Great Britain) 
10 according to the method of Rambach and Hogness (1977, P.N.A.S. USA, 

74; 5041-5045). 

Preparation of the membrane fractions and the cytoplasmic 
15 fractions of E. coli 

E. coli JM83 and strains derived from it were placed in 
culture in BHI medium until an optical density (0D &00 ) of 0.7 was 
attained, washed and suspended in a TE buffer. The cell suspension 

20 was treated by sonication (ultrasound) for 20 seconds at doses of 50 

W in a cell fragmentation apparatus in a Branson B7 sonication apparatus 
and the intact cells were removed by centrifugation for 10 minutes 
at 6000 g. The supernatant was fractionated into membrane and 
cytoplasmic fractions by means of centrifugation for 1 hour at 100,000 

25 g. 

Electrophoresis on SDS-polyacrylamide gel (SDS-PAGE) 

The proteins from the bacterial fractions were separated 
30 by means of SDS-PAGE on linear gradients of polyacrylamide gels (7.5% 

- 15%) (Laemmli 1970, Nature 227 : 680-685). The electrophoresis was 
carried out for 1 hour at 200 V, then for 3 hours at 350 V. The gels 
were stained with Coomassie blue. The proteins of the extracts were 
separated on 10% polyacrylamide gels and visualized by means of 
35 autoradiography ♦ 
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Purification of the protein band and determination of the 
N-terminal sequence 

The proteins of the membrane fractions of an induced culture 
^ °f E. faecium BM4147 were separated by means of SDS-PAGE. The gel was 

electrotransferred for 1 hour at 200 mA to a polyvinylidene difluoride 
membrane (Immobilon Transfer, Millipore) by using a transfer apparatus 
(Electrophoresis Unit LKB 2117 Multiphor II) in accordance with the 
instructions of the manufacturer. The transferred proteins were stained 
10 with Ponceau red. The portion of membrane bearing the protein of 

interest was excised, centered on a Teflon filter and placed in the 
cartridge of a sequencer (Sequencer Applied Biosystems model 470A). 
The protein was sequenced by means of the automated Edman degradation 
(1967, Eur. J. Biochem. 1; 80-81). 
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Construction of plasmids 



The plasmid pAT213 (Brisson-Noel et al., 1990, Antimicrob. 
Agents Chemother., 34; 924-927) consists of a EcoR I fragment of DNA 
20 of 4.0 kb of the enter ococcal plasmid pIP816 cloned at the EcoR I site 

of a Gram-positive-Gram-negative shuttle vector pAT187 (Trieu-Cuot 
et al., 1987, FEMS Microbiol. Lett. 48; 289-294). In order to construct 
pAT214, the EcoR V -SacI I DNA fragment of 1761 bp of pAT213 was purified, 
treated with the Klenow fragment of the DNA polymerase I of E. coli 
25 and ligated to the DNA of pUC18 which had previously been digested 

with Smal and dephosphorylated (Figure 2). The cloning (Maniatis et 
al., 1982 Cold Spring Harbor Laboratory Press) was carried out with 
restriction endonucleases (Boehringer Mannheim and Pharmacia), with 
the T4 DNA ligase (Pharmacia) and alkaline phosphatase (Pharmacia) 
according to the instructions of the manufacturer. 
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Subcloning in M13 and nucleotide sequence 



The DNA restriction fragments were subcloned in the polylinker 
5 of the replicative forms of the derivatives mpl8 and mpl9 of the 

bacteriophage M13 (Norrand # er et al., 1983, Gene 26; 101-106), obtained 
from Pharmacia P-L Biochemicals . E.coli JM103 was transfected with 
recombinant phages and the single-stranded DNA was prepared* The 
nucleotide sequencing was carried out by the enzymatic di-deoxy 
10 nucleotide method (Sanger et al., 1977, P.N.A.S. USA 74; 5463-5467) 

by using a T7 DNA polymerase (Sequenase, United States Biochemical 
Corporation, Cleveland, Ohio) and ^- 35 S/dATP (Amersham, Great Britain). 
The reaction products were revealed on 6% polyacrylamide gels containing 
a denaturing buffer, 

15 

Data-processing analysis and data on the sequence 

The complete DNA sequence was assembled by using the computer 
20 programs DBCOMP and DBUTIL (Staden, 1980, Nucleic Acids Res 8; 3673- 

3694). The protein data bank PSEQIP of the Pasteur Institute was 
screened using an algorithm developed by Claverie (1984, Nucleic Acids 
Res 12; 397-407). The alignments between the pairs of amino acid 
sequences were constructed using the algorithm of Wilbur et al (1983, 
25 P.N.A.S. USA 80; 726-730). The statistical significance of the homology 

was evaluated with the algorithm of Lipman and Pearson (1985, Science 
227; 1435-1440). 

For each comparison 20 amino acid sequences were used to 
calculate the mean values and the standard deviations of the random 
30 results. 
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Genetic complementation tests 

The plasmids were introduced by transformation into E.coli 
ST64Q, a temperature-sensitive mutant with an unmodified D-ala-D-ala 
ligase (Lugtenberg et al 1973, J. Bacteriol 110; 26-34). The 
transformants were selected at 30°C on plates containing 100 *ig/ml 
of ampicillin and the presence of the plasmid DNA of the expected size 
and the restriction maps were verified • Single colonies grown at 30°C 
in BHI broth medium containing ampicillin were placed on a BHI agar 
medium containing both 100 *tg/ml of ampicillin and 50 iiM of isopropyl- 
1-thio-B-D-galacto-pyranoside (IPTG) and the plates were incubated 
at a permissive temperature of 30°C and at a non-permissive temperature 
of 42°C. The complementation test was considered to be positive if 
the colonies were present after 18 hours of incubation at 42°C* 



RESULTS 

Identification of the VanA protein and its N-terminal sequence 

The membrane fractions of the E« faecium BM4147 cells placed 
in culture, on the one hand, under conditions of induction, and, on 
the other, in the absence of induction, were analysed by means of SDS- 
PAGE. The sole difference which could be detected, related to the 
exposure to sub-inhibitory concentrations of vancomycin^ was the marked 
intensification of a band which corresponded to a protein of an 
estimated molecular weight of about 40 kDa. In the induced cells and 
in the non-induced cells, the protein band represents the same protein 
because this band is absent from membranes of a derivative of BM4147 
which has lost the pIP816 plasmid. The inducible protein, designated 
as VanA, was purified after SDS-PAGE and automated Edman degradation 
was carried out on a 50 pmol. sample. Nine amino acids of the N-terminal 
sequence of VanA were identified: Met Asn Arg lie Lys Val Ala lie Leu. 
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Sub-cloning of the vanA gene 

The insert of 4.0 kb of the plasinid pAT213 bears the 
determinant for resistance to the glycopeptides of E. faecium BM4147. 
Various restriction fragments of this insert were subcloned in pUC18 
and the recombinant plasmids specific for vanA in E. coli were 
identified by SDS-PAGE analysis of the proteins of the cytoplasmic 
and membrane fractions or of the extracts of the bacterial vesicles. 
This approach was used since E. coli is intrinsically resistant to 
the glycopeptide. The EcoR V -SacI I insert of the pAT214 plasmid (Figure 
2) codes for a unique polypeptide of 40 kDa which migrates together 
with VanA, derived from the membrane preparations of E. faecium BM4147. 

Nucleotide sequence of the insert in pAT214 and identification 
of the vanA coding sequence 

The nucleotide sequence of the EcoRV-SacII insert of 1761 
bp in pAT214 was determined on both strands of the DNA according to 
the strategy described in Figure 2, The location of the termination 
codons (TGA, TAA t TAG) in three reading frames on each DNA strand showed 
the presence of a unique open reading frame (ORF) which was sufficiently 
long to code for the VanA protein* This reading frame ORF is located 
between the TAA codon at position 281 and the TAG codon at position 
1406. The amino acid sequence deduced for ORF was compared with that 
of the N-terminus of VanA. The nine amino acids identified by protein 
sequencing are encoded in the nucleotide sequence beginning with the 
ATG (methionine) codon at position 377 (Figure 3). This codon for the 
initiation of translation is preceded by a sequence (TGAAAGGAGA) , 
characteristic of a ribosoraal binding site (RBS) in Gram-positive 
bacteria which is complementary to the 8 bases of the rRNA of the 16S 
subunit of Bacillus subtilis in its sequence (3'0H UCUUUCCUCC 5 f ) (Moran 
et al., 1982, Mol. Gen. Genet. 186; 339-346). In this ORF, there is 
no other ATG or GTG initiation codon between the positions 281 and 
377. The sequence of 1029 bp which extends from the ATG codon at 
position 377 to the TGA codon at position 1406 codes for a protein 
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containing 343 amino acid residues. The calculated molecular weight 
of this protein is 37400 Da, which is in agreement with the estimation 
of 40 kDa obtained by SDS-PAGE analysis. 



Homology of the amino acid sequences of VanA and the D-ala- 
D-ala lipase enzymes 



The screening of the protein data bank PSEQIP has shown the 
10 existence of a sequence homology between VanA and the D-ala-D-ala 

ligases of E.coli (ECOALA, Robinson et al. f 1986, J. Bacteriol. 167; 
809-817) and of Salmonella typhimurium (DALIG, Daub et al. t 1988, 
Biochemistry 27; 3701-3708). The calculated percentage of homology 
between pairs of proteins was included between 28% and 36% for the 
15 identical amino acids and between 48% and 55% by taking into 

consideration homologous amino acids. VanA and DALIG are more closely 
related. The statistical significance of these similarities wa evaluated 
by aligning VANA and sequences containing the same composition of amino 
acids as DALIG or ECOALA (Lipman and Pearson, 1985, Science 227; 1435- 
20 1440). 



Genetic complementation test for the activity of D-ala-D- 
ala ligase 

25 

The E.coli strain ST640 is a thermosensitive mutant exhibiting 
a deficient D-ala-D-ala ligase activity (Lugtenberg et al., 1973, J. 
Bacteriol. 113: 96-104). The plasmids pUC18 and pAT214 were introduced 
into E.coli ST640 by transformation. The strains ST640 and ST640 (pUC18) 
30 grew normally only at the permissive temperature (30°C) whereas E.coli 

ST640 (pAT214) grew both at the permissive temperature and at the non- 
permissive temperature (42°C). 
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This test shows that VANA is functionally related to the 
D-Ala-D-Ala ligases in E.coli and is probably capable of catalysing 
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the same ligation reaction as DALIG. 

II - VanS-VanR two-component regulation system for the control of the 
synthesis of depsipeptides of the precursor of p eotidoglycans 

MATERIALS AND METHODS 

Strains, plasmids and conditions of culture 

The restriction fragments of pIP816 (Tra~, Mob + , Vm r ) were 
cloned in derivatives of the vector pAT29 which constitutes a shuttle 
vector between the Gram-positive and Gram-negative bacteria (oriR pAMBl, 
oriR pUC, oriT RK2, spc, lacZ ) (Trieu-Cuot et al., 1990, Nucleic Acids 
Res. 18:4296). This vector was constructed by the inventors and used 
to transform the strain E.coli JM103 ( (lac-proAB), supE, thi, strA, 
sbcB15, endA, hspR4, F traD36, proAB, Lacl q , lacZ M15) (Messing et 
al., 1983, Methods Enzymol. 101 :20-78). The plasmid DNA was prepared 
by an alkaline lysis protocol on a small scale (Sambrook et al., 1982, 
Molecular cloning, a laboratory manual. Cold Spring Harbor Laboratory, 
Cold Spring Harbor NY) and introduced by electroporation (Cruz-Rodz 
A.L. et al., 1990, Mol. Gen. Genet. 224: 152-154) in E.faecalis JH2- 
2 (Fus R , Rif R ) (Jacob A.E. et al., 1974, J. Bacterid. 117: 360-372), 
by using a Gene Pulser apparatus (Bio-Rad Laboratories, Richmond, 
California). The restriction profiles of the purified plasmids from 
E. faecalis and E. coli were compared in order to detect possible 
rearrangements of DNA. 

The integrative plasmid pAT113 (Mob + , Em R , Km R , oriR PACYC184, 
attTnl545, LacZ ) (Trieu-Cuot et al., Gene 106: 21-27) carries the 
joined ends of the transposon Tnl545. This vector does not replicate 
in Gram-positive bacteria but is integrated into the chromosome of 
the host by illegitimate recombination mediated by the integrase of 
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Tnl545 or of Tn916 (Trieu-Cuot et al. previously mentioned). The 
integrative plasmids were introduced into E. faecalis BM4148 (strain 
JH2-2: :Tn916) by means of electroporation. This strain is modified 
by the transposon Tn917 described by Franque A.E. et al. (1981, J. 
5 Bacteriol. 145 : 494-502). 

The cultures were grown in brain-heart broth (BHI - Brain 
Heart Infusion Broth) or on agar at 37°C. The method of Steers et al 
(Antibiot. Chemother. Basel. 9: 307-311) was used to determine the 
10 minimal inhibitory concentrations (MICs) of the antibiotics on a 

Mueller-Hinton gelose agar medium. 



Recombinant DNA procedures 

15 

The cleavage of DNA with restriction endonucleases (Boehringer 
Mannheim and Pharmacia), the purification of the DNA restriction 
fragments from agarose gels, the conversion of the cohesive ends to 
blunt ends with the Klenow fragment of the DNA polymerase I of E.coli 

20 (Boehringer Mannheim), the dephosphorylation of the ends of the DNA 

with calf intestinal phosphatase (Boehringer Mannheim), the ligation 
of the DNA fragments with the T4 DNA ligase (Amersham) were carried 
out according to the standard methods of Sambrook et al (1982, Molecular 
Cloning, a Laboratory Manual. Cold Spring Harbor Laboratory. Cold Spring 

25 Harbor NY). 



Construction of plasmids 



30 The origin of the vectors and the inserts used for the 

recombinant plasmids constructed here is the following: 
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(i) vector pAT78 for the recognition of the promoter: the 
amplified DNA of the cat gene for chloramphenicol 
acetyltransferase of the plasmid pC194 of Staphylococcus 
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aureus (Horinouchi et al., 1982, J. Bacterid. 150 : 815- 
825) was inserted between the PstI and SphI restriction sites 
of the shuttle vector pAT29. Amplification by means of the 
polymerase chain reaction was carried out by means of primers 
Al and A2 which were synthesized by the methoxy phoshoramidite 
method (Mabilat et al,, 1990, Plasmid 23: 27-34). The sequence 
of the primer Al (5* GCTGCAGA TAAAAATTTAGGAGG ) is composed 
of a PstI recognition site (underlined) and 18 bases 
(positions 6 to 23) of pC194 which include the ribosomal 
binding site (RBS ; AGGAGG positions 18 to 23) of the cat 
gene. The sequence of the primer A2 (5* CGCATGCT ATTATAAAA 
GCCAGTC) contains the SphI cleavage site (underlined) and 
is complementary (positions 8 to 24) to 17 bases at the 3' 
end of the cat gene. The triplet ATT at positions 9 to 11 
correponds to the TAA stop codon of cat. The DNA fragments 
amplified with the primers Al and A2 hence consist of an 
open reading frame (orf) and a ribosomal binding site for 
CAT (positions 1234 to 1912 according to the numbering of 
Horinouchi et al. (1982, J. Bacterid. 150: 815-825) flanked 
by the PstI and SphI sites. The position 1234 is located 
at the interior of the loop of the secondary structure of 
the mRNA which blocks translation in the absence of 
chloramphenicol. Thus, the amplified sequence does not 
contain the cat promoter nor the sequence complementary to 
the RBS which is essential for the regulation of translation 
Ambulos, N.P. et al., 1984, Gene 28: 171-176). 

(ii) expression vector pAT79: the Clal-BssH II fragment of 
243 bp bearing the P2 promoter of the aphA-3 gene of the 
enterococcal plasmid pJHl (Caillaud et al., 1987, Mol. Gen. 
Genet. 207 : 509-513) was inserted between the EcoRI and Sac I 
restriction sites of pAT78. 

(iii) plasmid pAT80 and its derivatives: the Bglll-Xbal 
fragment of 5.5 kb of pIP8l6 was inserted between the BamHI 
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and Xbal sites of pAT78. The resulting plasmid, designated 
as pAT80 was partially digested with Hind i and ligated with 
the EcoR V fragment containing a gene related to the apha-I 
gene of the transposon Tn903 (Oka A* et al. f 1981, J. Mol. 
Biol* 147 :217-226, This fragment contains the aphA-I gene 
which codes for the 3 1 aminoglycoside phosphotransferase of 
type I conferring resistance to kanamycin. The insertion 
of aphAI wa s carried out at three different sites in pAT80, 
generating the plasmids pAT81, pAT83 and pAT85, The cassettes 
BamH I and EcoR I containing aphA-I were inserted at the BamH I 
(to form the plasmid pAT84) and EcoR I (to form the plasmid 
pAT82) sites of pAT80, 

(iv) plasmids pAT86, pAT87, pAT88 and pAT89: the plasmid 
pAT86 was constructed by cloning the EcoR I -SacI I fragment 
of 2.803 bp of pAT80 coding for VanH and VanA at a Smal site 
of pAT79. pAT87 was obtained by inserting the EcoRI-Xbal 
fragment of 3*4 kb of pAT80 upstream from the cat gene of 
the detection vector of promoter pAT78, The plasmid pAT88 
resulted from the ligation of pAT78 digested with EcoR I and 
BamH I to the EcoRI-BamHI fragment of 1,731 bp of pAT80. The 
Bglll-AccI fragment (positions 1 to 2356) of pAT80 was 
inserted into the polylinker of the integrative vector pAT113, 
generating pAT89. 

Sub-cloning in Ml 3 and sequencing 

The DNA restriction fragments were subcloned in a polylinker 
of replicative derivatives of the bacteriophage M13, these derivatives 
being called mp!8 and mpl9 (Norrander et al,, 1983, Gene 26:101-106), 
E.coli JM103 was transfected with the recombinant phages and a single- 
stranded DNA was prepared. The sequencing of the nucleotides was carried 
out according to the conditions described by Sanger et al. (Proc. Natl, 
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Acad. Sci. USA, 1977, 74: 5463-5467) by using the modified T7 DNA 

polymerase (Sequenase, United States, Biochemical Corporation Cleveland 
35 

OH) and /St- $7dATP (Amersham). The reaction products were resolved 
on gradient gels of polyacrylamide in a 6% buffer. 



Enzymatic test 

The JH2-2 derivatives of E. faecalis were grown to an 
optical density OD^qq of OJ in a BHI broth supplemented with 
spectinomycin (300 ng/ml). The cells were treated with lysozyme, lysed 
by sonication and the cell debris were centrifuged for 45 minutes at 
100,000 g according to the description given by Courvalin et al. (1978, 
Antimicrob. Agents Chemother, 13:716-725), The formation of 5-thio- 
2-nitrobenzoate was measured at 37°C in the presence and in the absence 
of chloramphenicol and the specific CAT activity was expressed in 
micromole per minute and per milligram of proteins (Shaw et al., 1975, 
Methods Enzymol. 43:737-755). 



RESULTS 

The vanH and vanA genes of pIP816 were cloned in a plasmid 
pAT79 under the control of the heterologous promoter P2 (Caillaud et 
al., 1987, Mol. Gen. Genet. 207 :509-513) and the plasmid pAT86 formed 
did not confer resistance to vancomycin on the strain E. faecalis JH2- 
2. These genes are thus not sufficient for the synthesis of peptoglycan 
in the absence of the antibiotic. Different restriction fragments of 
pIP816 were cloned in the vector pAT78. The Bglll-Xbal fragment of 
5.5 kb of pAT80 is the smallest fragment obtained which conferred 
resistance to vancomycin. 



34 



Nucleotide sequence of the vanR and vanS genes 

The sequence of the insert in pAT80 was determined on both 
strands of the DNA from the Bglll site to the ATG initiation codon 
for the translation of VanH* Two open reading frames (orf) were detected 
within the sequence of 2475 bp; the first open reading frame extends 
from the nucleotide 386 to the nucleotide 1123; at position 431 a 
sequence characteristic of the RBS sequences in Gram-positive bacteria 
is found, 6 base pairs upstream from the ATG initiation codon for 
translation (TGAAAGGGTG) ; the other initiation codons for translation 
in this orf are not preceded by this type of sequence. The sequence 
of 693 bp extending from the ATG codon at position 431 to the TAA codon 
at position 1124 is capable of coding for a protein of 231 amino acids 
with a molecular mass of 26,612 Da which is designated as VanR. 

In the case of the second open reading frame (from nucleotide 
1089 to nucleotide 2255) the amino acid sequence deduced from the first 
initiation codon in phase (TTG at position 1104) would code for a 
protein of 384 amino acids having a molecular mass of 43,847 Da and 
designated as VanS. The TTG codon at position 1116 and the ATG codon 
at position 1164 are in— phase initiation codons for translation preceded 
by sequences with low complementarity with the 3* OH terminus of the 
16S sub-unit of the rRNA of B. subtilis ( GGGGGGTTGG-N8-TTG and 
AGAAC GAAAA-N6-ATG , respectively ) . 

Between the last codon of vanS and the initiation codon ATG 
for the translation of vanH a sequence of 217 bp is to be observed 
which contains a repeated reverse sequence of 17 bp. This sequence 
does not function as a terminator of strong transcription. 

The comparison of the sequences obtained with data bases 
has shown that the conserved amino acid residues identified by Stock 
et al. (1989, Microbiol. Rev. 53:450-490) in the kinase domain of 16 
HPK (Histidine Protein Kinase) were detected in the C-terminal part 
of VanS. VanS possesses two groups of hydrophobic amino acids in the 
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N-terminal region. The histidine residue 164 of VAnS is aligned with 
the residue His216 of PhoR (Makino et al., 1986, J. Mol. Biol. 192: 
549-556) and His 243 of EnvZ (Comeau et al. f 1985, 164:578-584) which 
are presumed sites of autophosphorylation in these proteins. 

Similarly, the amino acids 1 to 122 of VanR exhibit similarities 
with the effector domains of response regulators RR. The aspartic acid 
53 of VanR might be a phosphorylation site because this residue is 
aligned with Asp 57 of Che Y which is phosphorylated by HPK associated 
with CheA and corresponds to an invariant position in other proteins 
of the RR type (Stock et al previously mentioned). VanR might belong 
to the sub-class OmpR-PhoB of RR which activates the initiation of 
transcription mediated by the RNA polymerase containing the 70S factor 
of E.coli (Stock et al. previously mentioned). 

Inactivation of the van genes by insertion 

Cassettes of resistance to kanamycin inserted in the group 
of van genes in the plasmid pAT80 have shown the following: the 
insertion in vanR suppresses resistance to vancomycin and 
chloramphenicol; VanR is an activator of transcription necessary for 
the expression of the genes for resistance to vancomycin. The 
inactivation of vanS leads to a two-fold reduction of the minimal 
inhibitory concentration (MIC) of chloramphenicol and to a three-fold 
reduction of the specific CAT activity but the minimal inhibitory 
concentration of vancomycin remains unchanged. Hence, VanS is necessary 
to produce a high level of transcription of the genes for resistance 
to vancomycin although it is not required for the expression of the 
phenotype of resistance to vancomycin. 

Derivatives of pAT80 bearing insertions in vanH (pAT83), 
vanA (pAT84) or in the region 1.0 kb downstream from vanA (pAT85) have 
made it possible to obtain resistance to chloramphenicol but not to 
vancomycin. This dissociated phenotype corresponds to the inactivation 
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of genes coding for enzymes which synthesize the depsipeptide precursors 
necessary for the assembly of the bacterial cell walls in the presence 
of vancomycin * 

Downstream from the vanA gene the presence of an inactivated 
orf has been detected in pAT85 in the region of the sequence of 365 
bp after the TGA codon of vanA and before the SacI I site and this orf 
contains an in-phase ATG initiation codon preceded by a RBS-like 
sequence. This sequence codes for a protein necessary for resistance 
to the glycopeptide, designated as VanX and which comprises maximally 
about 330 amino acids • 



Trans-activation of the transcription of the van genes 

The integrative plasmid pAT89 coding for VanR and VanS was 
introduced into the chromosome of E. faecalis BM4138. The plasmid 
pAT87 bearing the genes vanH, vanA and vanX cloned upstream from the 
cat gene lacking the promoter for pAT78 conferred resistance to 
vancomycin on this strain but not to E. faecalis JH2-2, The level of 
expression of the cat gene of pAT87 in the strains BM4138: :pAT89 and 
JH2 -2 indicated that VanR activates the transcription of the reporter 
gene localized at the 3* end of the group of van genes. Similar levels 
of CAT synthesis were observed for pAT88 which bears a transcription 
fusion between the 5 f parts of vanA and the cat gene* These results 
show that in E. faecalis BM4138: :pAT89 (pAT87) VanR and VanS encoded 
in the chromosome activate in a trans manner the transcription of vanA, 
vanH and vanX of pAT87 making possible the production of resistance 
to vancomycin. 

Moreover, it has been observed that the expression of the 
gene was essentially constitutive when vanR and vanS were borne by 
a multicopy plasmid pAT80 and weakly inducible by vancomycin when the 
genes for the regulatory proteins were present on the chromosome of 
the host. 
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III - Characterization of the sequence of the vanC gene of En terococcus 
sallinarum BM4174 

Definition and use of universal primers for the amplification 
of genes coding for D-Ala-D-Ala ligases and related p roteins 
implicated in resistance to vancomycin 

The protein VanA necessary for the expression of a high level 
of resistance to the glycopeptides in E. faecium BM4147 shares a 
similarity of about 28 to 36% as regards its amino acids with the D-Ala- 
D-Ala ligases of E.coli but possesses a different substrate specificity 
from that of these ligases. Peptides designated as 1 and 2 which are 
conserved in the sequences of the DdlA and DdlB ligases (Zawadzke, 
1991 Biochemistry 30:1673-1682) of E.coli and in the protein VanA were 
selected in order to synthesize universal, primers intended to amplify 
internal fragments of genes coding for D-Ala-D-Ala ligases or related 
enzymes. The peptide targets GEDG(S/T) (I/1)QG and NT(I/L)PGFT were 
translated back as is shown in Figure IV. 1 in order to obtain degenerate 
oligonucleotides VI and V2. As the peptides 1 and 2 of VanA, DdlA and 
DdlB are separated by amino acid sequences of similar length, the 
predicted size for the amplification product was about 640 bp. 

Amplification by means of PCR with the DNA of E.coli JM83 
and of E. faecium BM4147 made it possible to amplify products 
corresponding to the expected size which have then been purified and 
cloned in the bacteriophage M13mpl0 (Norrander et al. f 1983, Gene 
26:101-106). The sequencing of the insert obtained with E.coli JM83 
has shown that the product of PCR was an internal fragment of ddlA. 
A probe generated starting from a recombinant phage obtained with the 
amplification fragment of BM4147 was used for the Southern blot analysis 
of a DNA of BM4147 and BM4147-1 which is a derivative of BM4147 
sensitive to vancomycin and which lacks the plasmid pIP816 (Leclercq 
et al., 1988, N. Engl. J. Med. 319:157-161). The probe hybridized with 
the EcoR I DNA fragment of 4 kb from BM4147 but not with the DNA from 
E. faecium BM4147-1. As the vanA gene is borne by the EcoRI fragment 
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of 4 kb from pIP816, these results indicate that the primers also make 
possible the amplification of a part of vanA. Thus the oligonucleotides 
VI and V2 may amplify fragments of genes coding for different proteins 
related to the D-Ala-D-Ala ligases, and may do this in different 
species. 



Amplification , cloning and sequencing of the vanC gene 

10 Amplification by means of PCR was carried out on the total 

DNA of E. gallinarum BM4174 and the amplification product obtained 
of about 640 bp was cloned in the bacteriophage M13mpl0. The single- 
stranded DNA isolated from the recombinant phage was used to construct 
a probe C (Hu et al., 1982, Gene J7:2171-2177) . In Southern analysis 

15 the probe hybridized with a PstI fragment of 1.7 kb from BM4174 but 

not with the DNA of BM4147 and BM4147-1. 

The DNA of BM4174 was digested with PstI and fragments of 
1.5 and 2 kb were purified by electrophoresis on agarose gel and cloned 

20 in pUC18 (Norrander et al., 1983, mentioned previously). The recombinant 

plasmids were introduced into E.coli JM83 by transformation and screened 
by hybridization on colonies (Sambrook et al.* 1989, Molecular cloning, 
a laboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY) by using the probe C. A homology was detected with a 

25 transformant harbouring a plasmid called pAT216 which contained a PstI 

insert of 1.7 kb. The sequence of the SacI-PstI part of 1347 bp of 
the insert of pAT216 was determined on both strands of the DNA. The 
location of the termination codons in the three reading frames of each 
strand of DNA revealed the presence of an ORF phase located between 

30 the TGA codons at positions 47 and 1244. The initiation codon of 

transcription ATG at position 215 is preceded by a sequence GAAAGGAA GA 
characteristic of the RBS sequences complementary to the RNA of the 
1§S subunit of B. subtilis (Moran et al., 1982, Mol. Gen. Genet. 
186 :339-346). The sequence of 1029 bp which extends from the ATG codon 

35 at position 215 to the TGA codon at position 1244 might code for a 
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protein of 343 amino acids having a calculated molecular mass of 37504 
Da designated as VanC. A sequence homology was detected between VanC, 
VanA and the D-Ala-D-Ala ligases of E.coli . In particular, four domains 
of strong homology previously found between VanA and the D-Ala-D-Ala 
5 ligases of the enterobacteria are also present in VanC. The percentage 

of identical amino acids calculated for these proteins taken two at 
a time varied between 29 and 38%. The alignment of the four sequences 
revealed the presence of 57 invariant amino acids which include the 
conserved residues of the peptides 1 and 2 used to define the 
10 oligonucleotide probes VI and V2. 



Inactivation of the vanC gene by insertion 

15 In order to evaluate the contribution of vanC to resistance 

to vancomycin in E. gallinarum BM4174, the vanC gene was inactivated 
by insertion. A EcoR I -HincI I fragment of 690 bp, internal to vanC was 
cloned in pAT114 which does not replicate in Gram-positive bacteria. 
The resulting pAT217 plasmid was introduced into BM4174 by 

20 electroporation (Cruz-Rodz et al., 1990, Mol. Gen. Genet. 224; 152-154) 

and the clones supposed to result from a homologous recombination 
leading to the integration of pAT217 into vanC were selected on 
erythromycin. The clone BM4175 was compared with BM4174 by Southern 
hybridization using the probe C and aphA-3 specific for pAT114 # The 

25 two probes hybridized with the EcoR I fragment of 8.6 kb from BM4175. 

The probe C hybridized with a fragment of 2.5 kb from BM4174 whereas 
no signal was observed with the probe aphA-3 . The results indicate 
that the plasmid pAT217 of 6.1 kb was integrated into the vanC gene. 
The determination of the minimal inhibitory concentration of vancomycin 

30 for BM4174 (16 mg/1) and BM4175 (2 mg/1) indicated that the inactivation 

by insertion in vanC abolishes resistance to vancomycin. 

VanC is thus required for resistance to vancomycin. It may 
thus be supposed that this protein synthesizes a dipeptide or a 
35 depsipeptide which is incorporated into the precursors of peptido- 
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glycans and is not recognized by vancomycin. 

The sequences which are the object of the invention are given 
in the following pages after the list of the sequences containing the 
description of these sequences. In this list of the sequences, the 
proteins are identified with respect to the position of the nucleotide 
bases corresponding to the amino acids of the extremities of the 
proteins* 
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List of the sequences 



(contained in the sequences I (la, lb), II presented below or in the 
sequence shown in Figure 5). 

5 

Amino acid sequences 

SEQ ID NO 1 (VanH) : sequence of the first resistance protein, 
corresponding to the amino acid sequence of the open reading frame 
10 No, 3, starting at the base 3501 and terminating at the base 4529, 

containing the sequence coding for the vanH gene between the bases 
3564 and 4529 with respect to the sequence shown in Figure 5 or 
corresponding to the sequence between the positions of the nucleotides 
6018 and 6983 of the sequence la, 

15 

SEQ ID NO 2 (VanA) : sequence of the VanA protein, corresponding to 
the amino acid sequence of the open reading frame No, 1, starting at 
the base 4429 and terminating at the base 5553 with respect to the 
sequence shown in Figure 5 or corresponding to the sequence between 
20 the positions of the nucleotides 6977 and 7807 of the sequence la. 



SEQ ID NO 3 (VanX) : sequence of the third resistance protein, 
corresponding to the amino acid sequence of the open reading frame 
No, 3, starting at the base 5526 and terminating at the base 6167 with 
respect to the sequence shown in Figure 5 or corresponding to the 
sequence between the positions of the nucleotides 7816 and 8621 of 
the sequence la. 

SEQ ID NO 4 (VanR) : sequence of the regulatory protein R, corresponding 
to the amino acid sequence of the open reading frame No # 1, starting 
at the base 1477 and terminating at the base 2214 with respect to the 
sequence shown in Figure 5 or corresponding to the sequence between 
the positions of the nucleotides 3976 and 4668 of the sequence la. 
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SEQ ID NO 5 (VanS) : sequence of the sensor protein S, corresponding 
to the amino acid sequence of the open reading frame No. 2, starting 
at the base 2180 and terminating at the base 3346 with respect to the 
sequence shown in Figure 5 or corresponding to the sequence between 
5 the positions of the nucleotides 4648 and 5800 of the sequence la, 

SEQ ID NO 16 : sequence of the transposase corresponding to the amino 
acids included between the nucleotides 150 and 3112 of the sequence 
lb. 

10 

SEQ ID NO 17 : sequence of the resolvase comprising the amino acids 
situated between the positions of the nucleotides 3187 and 3759 of 
the sequence la. 

15 SEQ ID NO 18 : VanY sequence comprising the amino acids situated between 

the positions of the nucleotides 9046 and 9960 of the sequence la. 

SEQ ID NO 19 : VanZ sequence comprising the amino acids situated between 
the positions of the nucleotides 10116 and 10598 of the sequence la. 

20 

SEQ ID NO 20 : VanC amino acid sequence shown in list II. 



- Nucleotide sequences 

25 

SEQ ID NO 6 : nucleotide sequence containing the sequence coding for 
the 5 proteins as well as the flanking sequences, shown in Figure 5. 

SEQ ID NO 7 : sequence containing the sequence coding for the 3 
30 resistance proteins as well as the flanking sequences and starting 

at the base 3501 and terminating at the base 6167, shown in Figure 5. 
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SEQ ID NO 8 : sequence of the vanA gene, starting at the base 4429 
and terminating at the base 5553 of the sequence shown in Figure 5, 
or corresponding to the nucleotide sequence situated between the 
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nucleotides 6977 and 7807 of the sequence la. 

SEQ ID NO 9 : sequence coding for the first resistance protein called 
VanH, starting at the base 3501 and terminating at the base 4529, in 
particular the sequence vanH, the coding sequence of which is located 
between the bases 3564 and 4529 of the sequence shown in Figure 5, 
or corresponding to the nucleotide sequence situated between the 
nucleotides 6018 and 6983 of the sequence la. 

SEQ ID NO 10 : sequence coding for the third resistance protein VanX, 
starting at the base 5526 and terminating at the base 6167 of the 
sequence shown in Figure 5, or corresponding to the nucleotide sequence 
situated between the nucleotides 7816 and 8621 of the sequence la. 

SEQ ID NO 11 : sequence of the transposon coding for the transposase, 
the resolvase, vanR, VAnS, VanH, VanA, VanX, VanY and VanZ and 
containing the repeated reverse sequence of 38 bp at its N- and C- 
termini and corresponding to the sequence la* 

SEQ ID NO 12 : sequence coding for the transposase, starting at the 
base 150 and terminating at the base 3112 of the sequence lb. 

SEQ ID NO 13 : sequence coding for the resolvase, starting at the base 
3187 and terminating at the base 3759 of the sequence la. 

SEQ ID NO 14 ; sequence coding for VanY, starting at the base 9046 
and terminating at the base 9960 of the sequence la. 

SEQ ID NO 15 : sequence coding for VanZ, starting at the base 10116 
and terminating at the base 10598 of the sequence la. 

SEQ ID NO 21 : sequence coding for VanC, shown in the list II in 
relation to the protein VanC. 
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SEQ ID NO 22 : complete sequence la of the transposon of E. faecium , 
starting at the base 1 and terminating at the base 10851 . 

SEQ ID NO 23 : sequence coding for the protein VanR, starting at the 
base 3976 and terminating at the base 4668 of the sequence la, 

SEQ ID NO 24 : sequence coding for the protein VanS, starting at the 
base 4648 and terminating at the base 5800 of the sequence la. 
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CLAIMS 

1/ Composition of polypeptides, characterized in that it contains 
at least one protein or part of a protein selected from the amino acid 
5 sequences identified in the list of the sequences as SED ID NO 1 (VanH), 

SEQ ID NO 2 (VanA), SEQ ID NO 3 (VanX) or SEQ ID NO 19 (VanC) or any 
protein or part of a protein recognized by the antibodies directed 
against VanH, VanA, VanX or VanC or any protein or part of a protein 
encoded by a sequence hybridizing with one of the nucleotide sequences 

10 identified in the list of the sequences as SEQ ID NO 8, SEQ ID NO 9, 

SEQ ID NO 10 or SEQ ID NO 21 or with one of the following sequences 
VI or V2 under stringent or only slightly stringent conditions: 
VI ; GGX GAA GAT GGX TCX TTX CAA GGX 
G C AG C G 

15 A 

V2 : AAT ACX ATX CCX GGX TTT AC 
C T C 
C 

20 2/ Composition of polypeptides according to Claim 1, characterized 

in that it contains at least 3 proteins or any part of one or more 
of these proteins necessary to confer on Gram-positive bacteria 
resistance to antibiotics of the glycopeptide family, in particular 
to vancomycin and/or teicoplanin or to promote this resistance, in 

25 particular in strains of the family of the Gram-positive cocci, these 

proteins or parts of proteins being 

a) either recognized by antibodies directed against one of the 

sequences identified in the list of the sequences as SEQ ID NO 
1 (VanH), SEQ ID NO 2 (VanA), SEQ ID NO 3 (VanX), 
30 b) or encoded in genes containing a sequence identified as SEQ ID 

NO 8, SEQ ID NO 9 or SEQ ID NO 10 or hybridizing with one of these 
sequences or its complementary sequence or with the sequences VI 
or V2 under stringent or only slightly stringent conditions* 
3/ Composition of polypeptides according to Claim 1 or 2, characterized 
35 in that it corresponds to the combination of the proteins designated 
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as SEQ ID NO 1 (VanH), SEQ ID NO 2 (VanA), SEQ ID NO 3 (VanX). 
4/ Composition of polypeptides according to Claim 2 or Claim 3, 
characterized in that the VanC protein corresponding to the sequence 
SEQ ID NO 19 replaces the VanA protein corresponding to the sequence 
SEQ ID NO 2. 

5/ Composition of polypeptides according to any one of the Claims 
1 to 4, characterized in that the amino acid sequences necessary for 
the expression of resistance to antibiotics of the glycopeptide family, 
in particular to vancomycin and/or teicoplanin are under the control 
of regulatory elements, in particular proteins corresponding to the 
sequences designated as SEQ ID NO 4 (VanR) or SEQ ID NO 5 (VanS) in 
the list of the sequences. 

6/ Composition according to any one of the Claims 1 to 5, characterized 
in that it is encoded in one of the sequences SEQ ID NO 6, SEQ ID NO 
15 11, SEQ ID NO 22 identified in the list of the sequences. 

7/ Purified protein characterized in that it corresponds to the 
sequence SEQ ID NO 2 (VanA) or to the sequence SEQ ID NO 19 (VanC), 
contained in the composition according to any one of the Claims 1 to 
3. 

20 8/ Protein characterized in that it corresponds to one of the sequences 

identified as SEQ ID NO 1 (VanH), SEQ ID NO 3 (VanX), SEQ ID NO 4 
(VanR), SEQ ID NO 5 (VanS), 

9/ Nucleotide sequence characterized in that it codes for an amino 
acid sequence according to any one of the Claims 1 to 8, or in that 

25 it is a complementary DNA sequence or a corresponding RNA sequence, 

10/ Nucleotide sequence of about 7.3 kb, corresponding to the Hind lll- 
EcoRI restriction fragment as obtained from the plasmid pIP816 
comprising this Hindlll-EcoRI fragment or any part of this fragment, 
in particular the 3.4 kb EcoRI-Xbal fragment, the EcoR V -SacI I fragment 

30 of about 1.7 kb and the 3.3 kb Hindlll-EcoRI fragment. 

11/ Nucleotide sequence according to Claim 10, characterized in that 
it contains the following restriction sites as obtained from the plasmid 
pIP816 in the order: 

Hindlll, Bglll, BglH. EcoRI, BamHI, Xbal, EcoRI 
35 12/ Nucleotide sequence according to any one of the Claims 8 to 10, 
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characterized in that it corresponds to one of the sequences identified 
as SEQ ID NO 6 , SEQ ID NO 7 or SEQ ID NO 22, or in that it includes 
one of these sequences or any part of one of these sequences or also 
any sequence or part of a sequence of complementary DNA, or any RNA 
sequence corresponding to one of these DNAs, capable of 

- either constituting a hybridization probe or primer for the detection 
of resistance to antibiotics of the glycopeptide family, in 
particular to vancomycin and/or teicoplanin in particular in strains 
of the family of the Gram-positive cocci, 

- or of coding for a sequence necessary for the expression or 
regulation of resistance to antibiotics of the glycopeptide family, 
in particular to vancomycin and/or teicoplanin in particular in 
strains of the family of the Gram-positive cocci, 

13/ Nucleotide sequence according to Claim 12, characterized in that 
it includes or in that it corresponds to one of the following sequences: 
VI : GGX GAA GAT GGX TCX TTX CAA GGX 
G C AG C G 
A 

V2 : AAT ACX ATX OCX GGX TTT AC 
C T C 
C 

14/ Nucleotide sequence according to any one of the Claims 10 to 12, 
characterized in that it is one of the sequences SEQ ID NO 8 (vanA), 
SEQ ID NO 9 (vanH), SEQ ID NO 10 (vanX), SEQ ID NO 21 (vanC), SEQ ID 
NO 12 (transposase), SEQ ID NO 13 (resolvase), SEQ ID NO 14 (vanY), 
SEQ ID NO 15 (vanZ), SEQ ID NO 23 (vanR), SEQ ID NO 24 (vanS) or any 
variant of one of these sequences provided that it codes for a protein 
having immunological and/or functional properties similar to those 
of the proteins encoded in the sequences SEQ ID NO 8 (vanA), SEQ ID 
NO 9 (vanH), SEQ ID NO 10 (vanX), SEQ ID NO 21 (vanC), SEQ ID NO 12 
(transposase), SEQ ID NO 13 (resolvase), SEQ ID NO 14 (vanY), SEQ ID 
NO 15 (vanZ), SEQ ID NO 23 (vanR), SEQ ID NO 24 (vanS), or provided 
that they make possible the detection of strains resistant to 
antibiotics of the glycopeptide family. 
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15 / Nucleotide sequence according to any one of the Claims 9 to 12, 
characterized in that it corresponds to the sequence SEQ ID NO 6 or 
to the sequence SEQ ID NO 22 or in that it includes this sequence, 

16 / Recombinant sequence, characterized in that it includes a sequence 
5 of nucleotides according to any one of the Claims 9 to 14 under the 

control of regulatory elements capable of contributing to the expression 
of resistance to antibiotics of the glycopeptide family, in particular 
to vancomycin or teicoplaiiin in a specific host. 

17 / Recombinant vector, characterized in that it includes a nucleotide 
10 sequence according to any one of the Claims 9 to 16, at a site 

inessential for its replication under the control of regulatory elements 
capable of contributing to the expression of resistance to antibiotics 
of the glycopeptide family, in particular to vancomycin or teicoplanin f 
in a specific host, 

15 18/ Recombinant vector according to Claim 17, characterized in that 

it is the plasmid pAT214. 

19/ Recombinant cell host, characterized in that it includes a 
nucleotide sequence according to any one of the Claims 9 to 16 or a 
vector according to Claim 17 or Claim 18 under conditions leading to 

20 the expression of resistance to antibiotics of the glycopeptide family, 

in particular to vancomycin or teicoplanin t this host being for example 
selected from the bacteria, in particular from the Gram-positive cocci. 
20/ Nucleotide probe, characterized in that it is a DNA or a RNA and 
in that it is capable of hybridizing with a sequence according to any 

25 one of the Claims 9 to 15, this probe being if necessary labelled, 

for example it is one of the nucleotides: 
VI : GGX GAA GAT GGX TCX TTX CAA GGX 
G C AG C G 
A 

30 

V2 : AAT ACX ATX CCX GGX TTT AC 
C T C 
C 

21/ Nucleotide probe according to Claim 19, characterized in that 
35 it is specific for the sequences in Gram-positive bacteria encoding 
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a protein for resistance to glycopeptides, in particular to vancomycin 
and/or teicoplaninand is universal among these sequences. 
22/ Nucleotide probe according to Claim 20, characterized in that 
it is specific for a nucleotide sequence coding for a protein neccessary 
5 for the expression of high-level resistance to antibiotics of the 

glycopeptide family, in particular to vancomycin and teicoplanin in 
Gram-positive bacteria, 

23/ Nucleotide probe according to Claim 20, characterized in that 
it is specific for a nucleotide sequence coding for a protein necessary 
10 for the expression of low-level resistance to antibiotics of the 

glycopeptide family, in particular to vancomycin in Gram-positive 
bacteria* 

24/ Nucleotide probe according to any one of the Claims 20 to 23, 
characterized in that it hybridizes with a non-chromosomal nucleotide 
15 sequence of a strain resistant to glycopeptides, in particular to 

vancomycin and/or teicoplanin in particular that it hybridizes with 
a non-chromosomal nucleotide sequence of a strain of Gram-positive 
cocci, for example a strain of enterococci and preferably E. faecium 
4147. 

20 25/ Polyclonal or monoclonal antibodies, characterized in that they 

recognize the composition according to any one of the Claims 1 to 6 
or an amino acid sequence according to any one of the Claims 7 or 8. 
26/ Kit for the in vitro diagnosis in a biological sample of the 
presence of strains resistant to the glycopeptides, in particular to 

25 vancomycin and/or teicoplanin these strains belonging in particular 

to the Gram-positive cocci, in particular in that they are strains 
of enterococci, for example E. faecium , characterized in that it 
contains : 

- antibodies according to Claim 25, labelled if necessary, 

30 - a reagent for the detection of an immunological reaction of the 

the antigen-antibody type, 

- where appropriate, reagents to effect the lysis of the cells 
in the sample to be tested. 

27/ Kit for the in vitro diagnosis of the presence of strains resistant 
35 to the glycopeptides, in particular resistant to vancomycin and/or 
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to teicoplanin these strains belonging in particular to the 
Gram-positive cocci, in particular in that they are strains of entero- 
cocci, for example E. faecium , characterized in that it contains: 

- a nucleotide probe according to any one of the Claims 20 to 24, 
5 and if necessary, 

- oligonucleoside triphosphates dATP, dCTP, dTTP, dGTP, 

- an agent for the polymerization of DNA. 

28/ Procedure for the in vitro detection of the presence of strains 
resistant to the glycopeptides, in particular to vancomycin and/or 
10 teicoplanin these strains belonging in particular to the family of 

the Gram-positive cocci, in particular in that they are strains of 
enterococci, for example £♦ faecium or E. gallinarunu characterized 
in that it comprises: 

a) the placing of a biological sample likely to contain the resistant 
15 strains in contact with a primer constituted by a nucleotide 

sequence according to any one of the Claims 20 to 24, which is 
capable of hybridizing with the nucleotide sequence under 
investigation, necessary for the expression of resistance, this 
sequence being used as matrix in the presence of the 4 different 
20 nucleoside triphosphates and a polymerization agent under 

hybridization conditions such that for each nucleotide sequence 
which has hybridized with a primer, an elongation product of 
each primer is synthesized which is complementary to the matrix, 

b) the separation of the matrix from the elongation product obtained, 
25 this latter being then also able to serve as matrix, 

c) the repetition of step a) so as to produce a detectable amount 
of the nucleotide sequences under investigation, 

d) the detection of the amplification product of the nucleotide 
sequences, 

30 



35 
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ABSTRACT 

Polypeptides implicated in the expression of resist ance to 
glycopeptides. in particular in Gram-positive bacteria. Nucleotide 
sequence coding for these polypeptides and use for diagnosis 

The invention relates to a composition of polypeptides, 
characterized in that it contains at least one protein or part of a 
protein selected from the sequences of amino acids identified in the 
list of the sequences as SEQ ID NO 1 (VanH), SEQ ID NO 2 (VanA), SEQ 
ID NO 3 (VanX) or SEQ ID NO 19 (VanC), or any protein or part of a 
protein recognized by the antibodies directed against VanH, VanA, VanX 
or VanC, or any protein or part of a protein encoded in a sequence 
hybridizing with one of the nucleotide sequences identified in the 
list of the sequences as SEQ ID NO 8, SEQ ID NO 9 or SEQ ID NO 10 or 
with one of .the following sequences VI or V2 under stringent or only 
slightly stringent conditions: 
VI : GGX GAA GAT GGX TCX TTX CAA GGX 
G C AG C G 
A 

V2 : AAT ACX ATX CCX GGX TTT AC 
C T C 
C 

The invention also relates to the nucleotide sequences coding for 
these polypeptides as well as their utilization for the diagnosis of 
resistance to the glycopeptides* 
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UH- OC_)CD<XCDCDUUJ— <TUH-LDI— 
I — <dH-<XCDCD| — CJCDI — <X<ICJ1 — H- <T 

i — <x i — h- <r o h- cj <i i — CDdi — H-<r<r 

^h-J— h-CDCJ<XCJ<X<Il— <I<XCDCDCD 
H-<TV— CDH-<T<Xi— <rcjh-cj<x<n<ri— 
CJH-<X<lH-CDH-<r<Ih-<IH-CD<t<rH- 
h- CDCJ<Xh- (— CJClJ— <T H- CJ <I <X CD H- 

F-<ru<iu<rcDc:<rh-cjh-uri5CD<i 

UCJCCCCCJl— <IH-CJCDH-CD(LJ<I<IH- 
UOUH-OCDCJCJlJ<IOH-<ECDCJCJ 
<ICDh- CDf— H-<I1— H-CJ<rH-<I<r<IH- 
1 — <T <X CD 1 — h— <T1 — 1 — i — I — cj *— <i t— <r 
<I CJ CJ CD 1 — <L* CD ! — 1 — 1 — CD^TCJKll — CD 

<rcDCD<rj— i— <cr<^r<:r<if— cdcth-i— <r 
<E<icDCJ<r<r<iH-cucJCD<itD<r<r<r 

i — 1 — i — H- 1 — I— CD CJ CD i — <I <X CJ I — CD CJ 
1 — <X<XCJCD<XCJ<XCDl — CDI — 1 — <ICJ<E 

dCD<i:<rh-i— u<r<ri— <r<xt— h-<tcj 

1 — | — CD CJ CD <I f~ CJ 1 — 1 — CJ CD ! — CD ^ 

cDCDH-f— <rcDi— i— h-<r<r>— <r<r<ri— 
CJ<KTi — <ecj<t<ih-<e<iljcjh-h-cd 

CJ i — CD<I<E! — CD! — CJCJ CJ 1 — I — <I I — 
cXCD<TI— OH-CD!— h-CDi— CD CD CD CD H- 
cXH-l — <ICJH-CDCJ<ICJ<XCJI — CD J— CD 
H-CDCDCDCJCDCJCDCJf— t— CDCDI— CDCJ 

cj<r<rcji— h-cjcji— cj <x <i o <x<i i— 
<r <i cj t— <r cj <r i — > — *— i — cd <i <i cj 

f— CJCD*— CDI— <£!— <TCJ<r<-Tt— CD<E<I 

h-i— i— <ii— <rucjuoh-«o:<rcD<ru 

i— i— <rcj<r<icjcjr_i<rh-K-}— cxtj— 
dh-cjf— <rcjcDi— r- H-cri— <rt— cj<i 

CD CJ 1 — CD<TCJCDCJ<ICJ<ILDI — <I ■ — <T 
<I1 — i — 1 — CJ <I >— CJ h— i — CJ <I 1— CD ' — CD 
CD) — <T<T1 — C_*i<TCJCDCJ! — i — CD<ECJCJ 

<r<ru < ^rh-<rH-, u -<ii— h-<xcdcd<i^ 
h-ucd<ii— <xcdi— cjh-cdcjcjcjcecj 
!— cjt— t— cji— f— ut_i<r<ii— ^ <icjvC 
cu<icri— H-cjcDf— cj<ri— <i<r<r cd<i 

h- <X CJ »— CD CD!— i— cD<I<r<I^--tD>— 
l— cji— }— <rcDi— i— cd<ii— <i<i*— cjcj 

■dCJCICDcrt— <ri"£3CD<Ii— I— CDCDCD^T 

cjcj<icdcd<ii— cd<i!— f— <lcd<icjcd 

I — i— <T<XCJCD<ZCJI— CD<rCDr_}CJl— <E 

ui— <c<rt— cDcDcrcDJ— <ri— cdcd<tu 
cDCJCDCDh-<rh-<x<icriucjr-<rcD<i 

1 — CD <T I — CJ CD I — <L<LLD\— <Z\ — CJCD! — 
•J— i — ! — <I <I CJ h- I — CD CJ h- CD J— i — <E i— 
h- I— h- CD CJ CD *— CJ >- CJ h- <X\— »— <C CD 
H-UCJI—I--<3:CJCJCDI— H- I— <XH-<I*- 

ii — CJl — I — I — <E CD I — <E ' — <E CD I — <T CJ *£C" 

uuh <r<rui— <r <r cd i— <r<r<r<Et— 

.H-CDh-<I^I— <I:<I<r^—CD^-CDCJ<I^- 
|— I— |— K-CD<ICD<I<ICJ»— CDI— <lCJt— 
1— h-UCJCDCDCJcri— cjh-cdcdcd^i 
i — <C i — CDK-H-CDCDCDCJl— <T CD <I<X CJ 

cjcjcD<rcD<i!— <tcd!— t— <r<r<rcD^- 
cdcji— «Kr<rucjf-cDK-<rcDCJCD<r 
<r cd f- <x <r i— h- <r h- h- <r u h- h- <r o 
<r<rt— u<r<r<r<x>— cju«n:<rH-cj<i 



CD CJ CD CD 1 — ! — i — <ICJ 

H-<r-o:cr<r<rcjH-<r 

CJ<I <Eh- I — H-i — LD<T 
CDt — CDi — CJ CD CJ <I CJ 
LDi — <I<XCI1 — LJI — CD 
f— CD<r<Th-CJCDh-CD 
<Xh- h-CDH-l— CUfJ 

ucdcj>— <iH-cjf— <r 
<dCDcD<i<i<r<ri— <i 
h-h- tzKKECJoD- 
icr cj i — cd cd cj i — <r i— 

CDCDi — CDI — CD H- <E <E 

<z cd <x <l cd <r <r cd h- 
Ci — <tcji — i — i — 
<i <r cd cd i — i — <i<r <r 
<i«crj— ui— <r<iv-<i 
t-H-i-<i<r<r<ii- <r 
uco<ru€H<ru(- 

<I1 — CDI — <T<T'r— I — <I 
CJ CJ CD <L U <E <T <I CD 
•r— <TCD<I1— CDi— CJl— 

<ih-<r<r<rf-uH-u 
<z\ — <r <x <i cd <r <r 
<r<tcD<i<r<i<rcj^ 

l— t— CD<ICDCDCD<IH- 

hi- uuh <r cd <i <r 
i— c_D<ri — cjcj ozcdi- 
cd h- <x <r <r h- <i<xcj 

<c h- CJ H- CD <I<I '-T CJ 

c<c<rcDt— 1— <kicj 
cj<icdo<o:i— CD^ICD 
cd u <x <i i— <r<r<icD 

CDCJCDCD<E<r<r<IU 

<icd*— <r<z<rcD<r5— 
c:<x<r<i<ncD<r^!— 
<i CJ CD <I CD CD <i <i<r 
<I<XCDI— CD<X<HD<T 

<rt— cji— <rcji— <l 

H-<Eh-<r<r<Ti— H-i— 
Cl CJ 1— CD CD !— H- i— <I 

cd cj <r cj ci <i i— c j i— 

CD<TH-CDCEH-<lOCJ 
U<XCJCDl— <It— CDCD 
<IF— I— <L<LU<Z<Ll- 

{— <r <r <i i— i- <z <i<r 
<r cd cd <r <r }— u <i<r 
cj<£<x>-H-cJcD<ci<a: 
<x <r CD J— CJ CD CD cd ^— 
<I CD CJ 1 — CD CD C£t '_J CD 

h- <r cd cd <r cd i— u h- 
<rcj<r<rci<c<rri}<i 

CD I — CD<T<II— <I'~S — 
CJ <T <T CE x CJ 1— '<— 
i— I — i — CDXh-CJCD<E 
<Ti — I — CDCD<ICDCJ!— 

<ii— <io:cdj— ucdj— 
H-<r<r<rcjcDcj<iH- 



Fig. h ( 1/5) 
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CI j— CJ CJ <L <L J— CD CD CD i- <C h- h- <T J— CD J— <E CJ <X >- <X CJ <X 

^<i<ruUHH<rLD<rHcriD<ruucDcD<iuu$uacD 

CD >- CD <I <X CD CD <X CD <T <I <I CD CD 1- U V- <X CD <I <T CD 

<x<r^<rH-OD<r<i<rcDU<icD>-f-<r<rcDCD«xcDCJ$»-cj 

C j^h-ut-u<rcD<r<r<rj->--i— <XH-h-cDh-<xcDCD-^^-cJ 

cca:<rcDCD)— iucJuu<rv-<iH-cD«n:i— <^<rcj<r<iH-<i 

<r <r i— <r cd cj <r <r <e -h- <r <x cd cd >— <r <r <e cd <l cj cj cr o cd 
cDScr<rrj3CDfiDcr<rH-(Zi<r!~cD<r^<r<rcD^-cD5<rocj 

i- 5 cj cj <r i- u cd <r h- u <r i- cd <r h- i- <r <r v- h- <r cj <r cd 
cxto^ui-<ri-cr<r<rc:<icD<r<E<rcDCj<r<rcD^cjcD 

CDl— CDCD<E<r<TUCDl— <TCD<ri— CDOCDCDCD*— CJCJ^J-CJ 
h - h- 1- CD I- <I CD h- I- <E CD I- U h- 'r- H- <I CJ <I <T <I H- H- U CJ 
^ I- h- CD h- h- <X <X CD <T »- h- CD U <X CD CD CD CD CD CD/J U UH- 

CD CD CJ <Z <T I- CJ h- 1- CD <I <E 1- <L CD J- U I- <X CJ <L <I CD H- LD 
CDrJDU<X<lrjDCDh-UU<I<E^-<XO^<roO<rUrDCJ<Ih- 

H-cj<i>-<icr<ncD^-<r<r<rcD<rcD<rcDH-cjcjcjcj<rv-cD 

CD <T CD <I <T CD CD 1— CJ CD <X <£ CJ CD U CD 1- H- CD CD h- H- <I j- CJ 
I- i— h- <r h- CD CD CD CJ CJ <L h- J— CD <I CJ H- H- <E CJ CJ <I H- H- CJ 

<TH-<r<lCDCJ<lCD<rCDCDI-l— <rCDCDUCD<rCDLDCDLD^CD 

»— h- cd <i <r <r <i i— <t cd u <r cd »— »— <i <r <r h- <r u y <r h- 

K-<E<lC:H-UI-<rCDCDCDCD!— CJCJCDf-CDCDV-CDCD^CDCJ 

U'icorjfJucouui-uocDuoi-<ri-cDacj<r5i-H 

<E CD I- H- CD U <I CJ CD <T CD h- H- H- <T <I <T f- J- <X <X <T <I <I <I 
1 — <£ CJ CD <T ID CD CJ 1 — 1—1 — H- <T <I CJ 1 — <X <T CJ 1 — CD <I LJ CD CD 
«cir^H-l— CDUCD<TCJ<rV-CDCJCDI--H-<I<r^-CDl— cj <i cj cd 
CUOU<lHHU<rCDHaCH<ri-U<lUHH<IH(J<I 

<r <i<i>- o ^cd<ei-<i<e cDi— <c<rr<rcDo<x<racDO<r cd 
<x<i<icr<n — <z<x<icDCJCDH-<x<ii—cj<rcDi — cd cd cj cd cj 

I — <£ CJ I — ! — CJCJi — <T1 — CDCDi — CJI — 1 — 1 — <TCD1 — 1 I — 5£*~H 

<x<r<Tcj<ruH-H-cj<rcDCDCjf— i— »— }— <xcjcdcjv— <x<3:u 

CD CI CD <T CD CJ h- CD H- CJ U I— <t • 1— CJ CJ <I >- <T H- CJ CD J— CD CJ 

cj<i<xj— <rcD<ri— f-i— cj<r<rcD<iH"<r<r<ri— cDcj<r<rcj 

cd j— i— i— <l <r <r <x j— h- h- ?— <r cj cd <x <r cd <r cd <i <x h- cj cd 
<ii— <Lu<r<r<i<rH-^'a:cD<x>-H-h-<a:<x<ru!^ocDcDCj 

CDCJ»— CJCDCD<IH-CD<IJ— CDCDl— <X<rCJ<rf— <E>— CD <I 
h-<XCDl_DCDCJ^h-<Ir-<i:CJCJ<Il— CDH- CD»— CD^-<TCJCDCJ 

<r i— h- U3 cd <r <r <r cd cj cj <r <x i— <r <x i— cj <r i— v- t— <r cd h- 
i — <r<r cjcj<xcJcjcJcDCDd^<r<ri — i — <x cj <x <x<x cj i — h- 
cjj— <xcDUh-<rh-cji— H-<rcDH-<x<ru<r<x<i!— cj<xcj<t 

<I h- rjD<i<X<rf-CD<TUCD<r<lCDCJH-CDCDCJ<XH-CDf— CD H- 

<i<rcD<x cd h-<if— u cd cj <r i— K<r<n-<r$<r cd i— 
»—<ih-cjh-<icD^-u<iui-o»-<i:<i'ircj«3:<rcDcD<r<rcD 

<ICD1— <IfJl— CJCJCJCD'CL<X<ICD <L -XL.*>CJl — 0<^0>—CD>—CD 

CDh-^ICJCDl — CJCJI — (JCJ1 — <Tl — <I<TI — 1 — 1 — ^D^T^E^^y 
J— H-UCDt-<ICDCDCJUH-<I < XCDCD<XCD<XCJC:<Il--CDl— LD 
CDCDCDr£)crCDUCD<r>— <I>-CJCJCDCDCJJ— CD<EH-CDH-CJCD 

cijf£j<rr_jL.ii — cdcji — cjfJ<rcji — i — cdcdi — <r<xf— <ri < ^- i ~\ 
cdi — — <xi — cji — cdcj<ii — i — < — cjcjcj < xcj<r<r| 

<T H- CD U <I CD H- H- CJ <I <L <L H- <t !- <C H- <C <I CJ <t H- <T <I 
ci}— <Ti — i — CDCDCJCJl — CJ CD ^1 1 — f — I — CD I — <T<LCJCJ5 1 — 5? 

<r cr <r »- cj <r <r cj j- <r u h- cd <r <r h- cd cj cd h- cd j- cj ^ 
cntD<ru<rLDUui-ci!<ruuHHu<ri-<ru<rcDCoya 

H-uucDCOuuu<rcDUw<r^-cDuH<r<rcjcj<r<rcu 
cd cj cj i- u cd u v- h- i- <r <r cd <x <r h- cj cd cd cd <x <e cd cd cd 

H-CDCD<IUCDCDCDh-CD<r<ICJ<I<rv-l— H-CDCDH-<rCJ<rH: 

crhUui-rjcoucoi-Uf-<ruH<rHi-Hi-H-<r<ii-<r 
h- h_ a *- <r h- <i h- <x <r cd <r <l r jd h- h- <x <r <r cj cj u cd <r u 
i— <x<r <r i— <r <i<r cj <l <r cd <i <r a <r )— u i— cd u i— cj cj <l 
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UgS8S!^tSf£g5S!!tg8§S85^t 

<T t- CD <T <I CD H- J- <L CJ <X CD <I CD CD H- H- CD h- CJ <I <X LD LJ 
CDU^SSS5Sf-CD<r»-C:^-fDCDCDH-^^C£CDgCDU 

H H- <E CD 3 CD CD <X <£ CD CD J- CJ h- CJ <X <I <X 1- H- U CD <E <E <L 
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t £ £ S S <r S <r f- <i <: cj <r <r cd cj h- h- c r- 

n t- <t <r f D <CL CD CD CD \— h- <X H- CD CD p «J3 <^ : 

^cdScdu^^^^^^^^^^F^S^!" 

^KSuCD^CDU^CDcI^crh-^^H-ggg 

g§o5?£oSSSu>-g>-2£CDH-g£ 
?S ri ?i <r r rn »- f i <t t_3 J- <E <T <L <L <L <~ H- U CD -3 



CDCJCJ<rCDH-U<IUt^5S2£^^fjtcD 

cDUuSSSSSScD^^jr^gcDHCD 



^u3wucDH-^cD<r<r<ip5CJCj^T<r^^;r: 
^rSr-f—S<rcjcDcjcji— i— <r i — <r <r i — co <z .-— 

S cd £ cd 5 cd h- h 5 »- 1- cj h- <x <r >- <£ c <r <r 
rDCDSS§cj^uS5PcD<ruv-ucD<rv-<xcD 

cdcdS^^cdSh-cjucdcdcjcdhk^cducdcd 

lZ t: fi t± S t- CJ <T CD h- h- <t CD I- 1- CD I- <I <I <£ <t 
SSBfeKOCJCJCDpHCJCDCJ^^^gCJgCJ 

<T <X £J H- <T CD CJ H- CD H- <T H- 1- h- <X <X C p W CD <I 
§CJoSScJ^^CD<XH-UCD£CD£UCJC<r<I 

? <r £r CD u <T CJ I— H- h- <r CL J— <t CD <t I- H- »- 
^KO^ol^SttSSRtSttcDcB 




Fig. 4 (5/5) 
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LysLeuPhePheLeuLeuIleCys** *ArgPheThrAsnArcIleLys * * *LeuLeuPhe 
SerPheSerPheCysSerPheValArgAspLeuLeuThrValleuAsnSerPhePheSer 
AlaPheLeuPheAlaHisLeuLeuGluIleTyr***ProTyr***IleAlaSerPheGin 
AAGCTTTTCTTTTTGCTCATTTGTTAGAGATTTACTAACCGTATTAAATAGCTTCTTTTC 

SerHisCysProCysPheProHisHisSerPheLysCysSerAspSerArgGlnTyrAsn 
AlalleAlaLeuAlaSerHisThrlleLeuSerSerValVallleAIaGlySerllelle 
ProLeuProLeuLeuP roThrPr oPhePheGlnVa 1********* GlnAla Val * * * Phe 
AGCCATTGCCCTTGCTTCCCACACCATTCTTTCAAGTGTAGTGATAGCAGGCAGTATAAT 

100 

PheVaIPheSer***LysIleTyrAlaPheMetGln***MetAsnGlyIleThrIlePhe 
LeuPhePheLeuArgLysSerMetHisSerCysSerArg**«MetAlaSerProPheSer 
CysPhePheLeuGluAsnLeuCysIleHisAlaValAspGluTrpHisHisHisPhePro 
T7TGTTTTTTCTTAGAAAATCTATGCATTCATGCAG7AGATGAATGGCATCACCATTTTC 

GlnSer***LeuMetLysValLeuLysCysHisSerIlePheThrGlnGlyLysSerTyr 
LysAlaAsn******ArgTyrLeuAsnValIleArgTyrSerLeuArgValLysValThr 
LysLeuI2eAspGluGlyThr***MetSerPheAspIleHisSerGly***LysLeuGln 
CAAAGCTAATTGATGAAGGTACTTAAATGTCATTCGATATTCACTCAGGGTAAAAGTTAC 

200 ... 
LysValValPheThrSerAsnPhePheGlnMetlleProLysCysIlePheProLeuArg 
LysSerTyrSerLeuArgIleSerPheLys***SerGlnSerValPheSerLeu***Gly 
SerArglleHisPheGluPheLeuSerAsnAspProLysValTyrPheProPheGluAsp 
AAAGTCGTATTCACTTCGAATTTCTTTCAAATGATCCCAAAGTGTATTTTCCCTTTGAGG 

300 



Fig. 5 ( 1/25) 



ET AL (703) 521-5940 



11/69 

IleMetlleLysArgGlyTrpThrAsnThrAsnLeuPheArcryrlleLeuTyrAspArg 
******SerSerGluAspGlyLeuThrProIleCysPheA£rIleTyrCysMetThrGlu 
AsnAspGlnAlaArgMetAsp***HisGlnSerValSerIleTyrIleVal***ProAsn 
ATAATGATCAAGCGAGGATGGACTAACACCAATCTGTTTCGArATATATTGTATGACCGA 
. • • • • 

IleTrpAspAlaPheAspMetSerValTrpProThrGlyIle?roLysAsnSer***Leu 
SerGlyMetLeuLeuI le * * *ValTyrGlyGlnProGlyTyrArgArgThrAlaAsn* * * 
LeuGlyCysPhe***TyrGluCysMetAlaAsnArgAspTzrGluGluGlnLeuIleGlu 
ATCTGGGATGCTTTTGATATGAGTGTATGGCCAACCGGGATACCGAAGAACAGCTAATTG 

400 

AsnSerLysSer***ThrValPhePheProProSerLeuIleAsnTyrFhe***IlePro 
ThrAlaAsnProLysArgPheSerScrLeuLeuArcLeuLe-7hrIleSerLysSerArg 
GlnGlnIleLeuAsnGlyPheLeuProSer?heAIaTyr** w LeuPheLeuAsnProVai 
AACAGCAAATCCTAAACGGTTTTCTTCCCTCCTTCGCTTATTAACTA7TTCTAAATCCCG 
• • • * • • 

PheGlyLysSerGluValGlyProGlnTyrProPhellePheArgAspLeuHisLysSer 
LeuGluLysValLys***ValProSerIleHisSerSerSerGlyIleCysIleLysAla 
TrpLysLys***SerArgSerProValSerI2eHisLeuGlnGlyPheAla***LysPro 
TTTGGAAAAAGTGAAGTAGGTCCCCAGTATCCATTCATCTTCAGGGATTTGCATAAAAGC 

500 ... * 

LeuSerLeuPheArgCysLysGlnPheSerThrSerArgAsnrieHisSerValSerPhe 
CysLeuCysSerGlyValSerAsnSerLeuProLeuAlallePhelleGlnTyrHisSer 
ValSerValProVal***AlaIleLeuTyrLeuSerGlnFteSerPheSerIleIlePro 
CTGTCTCTGTTCCGGTGTAAGCAATTCTCTACCTCTCGCAATTTTCATTCAGTATCATTC 

600 



Fig. 5 (2/25) 
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HisPheCysIlePheAsnLeuLeuVaiGlnLeuTyrlleAsnArgValTyrSerlleAsp 
IleSerValPheSerIleTyr***PheAsnTyrIleSerIleGluCysThrLeuLeuIle 
?heLeuTyrPheGlnPheIleSerSerIleIleTyrGln***SerValLeuTyr***Tyr 
CATTTCTGTATTTTCAATTTATTAGTTCAA7TATATATCAATAGAGTGTACTCTATTGAT 

• • • * * • 
ThrAsnValValAsp* * * * * *AsnHisSer * * *GluArgLeuIleArgLeuValSerLy s 

GlnMet******ThrAspLysIleIleValLysSerValSer***AspLeuSerGlnLys 
LysCysSerArgLeuIleLysSer ***LeuArgAlaSerfiisLysThrCysLeuLysAsn 
ACAAATGTAGTAGACTGATAAAATCATAGTTAAGAGCGTCTCATAAGACTTGTCTCAAAA 

700 

MetArg***TyrPheAlaGluAsnArgLeu7yrSerCysGlr*PheAsp***ProGluSer 
***GlyAspIleLeuArgLysIleGlyTyrIleArgValSerSerThrAsnGlnAsnPro 
GluValllePhcCysGlyLysSerValllePheValSerValArgLeuThrArglleLeu 
ATGAGG7GATATTTTGCGGAAAATCGGTTA7ATTCG7GTCAG7TCGACTAACCAGAATCC 

• • • ■ • • 
PheLys7hrIleSerAlaValGluArgAspArgAsnGly7yr7yrIleLysArgLysPhe 

SerArgGlnPheGlnGlnLeuAsnGluIleGiyMetAspIleIle***ArgGluSerPhe 
GlnAspAsnPheSerSer ltr **ThrArgSerGlu7rpIleLeuTyrLysGluLysValSer 
77CAAGACAA777CAGCAG7TGAACGAGA7CGGAA7GGA7A77A7A7AAAGAGAAAG7T7 

800 . 

GlnGluGlnGlnArgIleAlaSerAsnPheLysLysCys***7hrIle7yrArgLysMet 
ArgSerAsnLysGlySerArgAlaThrSerLysSerValArcArgPhe7hrGlyArg*** 
GlyAla7hrLysAspArgGluGlnLeuGlnLysValLeuAspAspLeuGlnGluAspAsp 
CAGGAGCAACAAAGGA7CGCGAGCAAC77CAAAAAG7G77AGACGA777ACAGGAAGA7G 

900 
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7hrSerPheMetLeuGlnThr***Leu61uSerLeuValValHisLysIleTyrLeuAsn 
HisHisLeuCysTyrArgLeuAsnSerAsnHisSer * **TyrThrArgSerIle*** He 
IlelleTyrValThrAspLeuThrArglleThrArgSerTtrGlnAspLeuPheGluLeu 
ACATCATTTATGTTACAGACTTAACTCGAATCACTCGTAGTACACAAGATCTATTTGAAT 

***SerIleThrTyrGluIleLysArgGlnVaI***AsnHis***LysIleHisGlyLeu 
AsnArg * * *HisThrArg* * *LysGlyLy sPheLys IleThrLysArgTyrMet Ala w * * 
IleAspAsnlleArgAspLysLysAlaSerLeuLysSerLeuLysAspThrTrpLeuAsp 
7AA7CGA7AACA7ACGAGA7AAAAAGGCAAG777AAAA7CAC7AAAAGA7ACA7GGC77G 

1000 

IleTyr61nLysIleIleHisThrAlaAsnSer***LeuLeu*«*TrpLeuValLeuThr 
PheIleArgArg***SerIleGlnProIleLeuAsnTyrCysAsnGlyTrpCys"*Pro 
LeuSerGluAspAsnProTyrSerGlnPheLeuIleThrValMetAlaGlyValAsnGln 
ATTTATCAGAAGATAATCCATACAGCCAATTC7TAArTACTG7AATGGCTGGTGTTAACC 
• • * • • . 

Asn * * *SerGluIleLeuPheGly * * * AspAsnVa !LysGlyLeuAsn7rpLeuArgLys 
IleArgAlaArgSer7yrSerAspGlu7hr7hr * * * ArgAsp* * * I leGly * * *GluArg 
LeuGluArgAspLeuIleArgMetArgGlnArgGluGlylleGluLeuAlaLysLysGlu 
AA77AGAGCGAGATC77A77CGGA7GAGACAACG7GAAGGGA77GAA77GGC7AAGAAAG 
1100 . . . . 

LysGluSerLeuLysValAsp***ArgSerIleIleLysIle7hrGlnGlu***IleMet 
ArgLysVal***ArgSerIleLysGluValSer***LysSerArgArgAsnGluLeuCys 
GlyLysPheLysGlyArgLeuLysLys7yrHisLysAsnHisAlaGlyMetAsn7yrAla 
AAGGAAAG777AAAGG7CGA77AAAGAAG7A7CA7AAAAA7CACGCAGGAA7GAA77A7G 

1200 
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ArgArgLysLeuTyrLysGluGlyAsnMetThrValAsnGlr.JleCysGluIleThrAsn 
GlyGluSerTyrIleLysLysGluIle***Leu"**IleLys?heValLysLeuLeuMet 
AlaLysAlaIle***ArgArgLys7yrAspCysLysSerAsnLeu*«*Asn7yr*"-Cys 
CGGXXAAAGC7A7A7AAAGAAGGAAA7A7GAC7G7AAA7CAAA777G7GAAA77AC7AA7 

ValSerArgAlaSerLeu7yrArgLysLeuSerGluValAsnAsn**»ProPheCysIle 
7yrLeuGlyLeuHis7yr7hrGlyAsn7yrGlnLys***IleIleSerHisSerValPhe 
Ile***GlyPheIleIleGlnGluIleIleArgSerGlu*«*LeuAlaIleLeuTyrSer 
GTATCTAGGGCTTCATTATACAGGAAATTATCAGAAGTGAATAATTAGCCATTCTGTAT7 

1300 

ProLeuMetGlyAsnIlePheLysGluGluLysGlu7hrIleLysTy'r«**GlnProPro 
Arg***7rpAlaIlePheLeuLysLysLysArgLysLeu***AsnIleAsnSerLeuLeu 
AlaAsnGlyGln7yrPhe***ArgArgLysGlyAsn7yrLysIleLeu7hrAlaSer*»» 
CCGC7AA7GGGCAA7A77T77AAAGAAGAAAAGGAAAC7A7AAAA7A77AACAGCC7CC7 

SerAspAlaGluLysProPheAspLysLysArgIleIleIleleuArgAsnSer***Ser 
AlaMetProLysSerProLeuIleLysLysGluSerSerSer***GluIleLeuSerHis 
ArgCysArgLysAlaLeu* * * * * *LysLysAsnHisHisLeuLy s Ly sPheLeuVa 1 1 le 
AGCGA7GCCGAAAAGCCC7TTGATAAAAAAAGAA7CATCA7C7TAAGAAA7TC7TAG7CA 

1400 . 

PheIleMet***MetLeuIleAsnSerAlaLeu***SerAspLysLeuLeuArgAlaAsn 
LeuLeuCysLysCysLeu***IleArgPro7yrAsnLeuIleAsnTyr***GlyGlnThr 
7yrTyrValAsnAla7yrLysPheGlyProIleIle******IleIleLysGlyLysLeu 
777A77A7G7AAA7GC77A7AAATTCGGCCC7A7AA7C7GA7AAAT7A77AAGGGCAAAC 

1500 
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LeuCysGluArgVallleThrMetSerAspLysIleLeuIleValAspAspGluHisGlu 
TyrValLysGly**** * *Leu* "AlalleLysTyrLeuLeuTrpMetMetAsnMetLys 
Met * * *Ly sGlyAspAsnTy rGluArg * * * AsnThrTyrCysGly * * * * * *Thr * * * Asn 
TTATGTGAAAGGGTGATAACTATGAGCGATAAAATACTTATTGTGGATGATGAACATGAA 
» • • • » * 

IleAlaAspLeuValGluLeuTyrLeuLysAsnGluAsnTyrThrValPheLysTyrTyr 
LeuProIleTrpLeuAsnTyrThr***LysThrArgIleIleArgPheSerAsnThrIle 
CysArgPheGly***IleIleLeuLysLysArgGluLeuTyrGlyPheGlnIleLeuTyr 
ATTGCCGATTTGGTTGAATTA7ACTTAAAAAACGAGAAT7ATACGG7TTTCAAATACTAT 

1600 

ThrAlaLysGluAlaLeuGluCysIleAspLysSerGluIleAspLeuAlalleLeuAsp 
ProProLysLysHisTrpAsnVal** *ThrSerLeuArgLeuThrLeuProTyrTrpThr 
ArgGlnArgSerIIeGIyMetTyrArgGlnVal***Asp***ProCysHisIleGlyHis 
ACCGCCAAAGAAGCA77GGAA7G7A7AGACAAG7C7GAGA77GACC77GCCA7A77GGAC 
• » • » • • 

IleMetLeuProGlyThrSerGlyLeuThrlleCysGlnLysIleArcAspLysHisThr 
SerCysPheProAlaGlnAlaAlaLeuLeuSerValLysLys***Gly7hrSerThrPro 
HisAlaSerArgHisLysArgPro7yr7yrLeuSerLysAsnLysGlyGlnAlaHisLeu 
ATCATGC77CCCGGCACAAGCGGCC77AC7A7C7G7CAAAAAA7AAGGGACAAGCACACC 

1700 .... 
7yrProIleIleMetLeu7hrGlyLysAsp7hrGluValAspLysIle7hrGlyLeuThr 
HeArgLeuSerCys***ProGlyLysIleGlnArg***IleLysLeuGlnGly***Gln 
SerAspTyrHisAlaAspArgGluArg7yrArgGlyArg***Asn7yrArgValAsnAsn 
7A7CCGA77A7CA7GC7GACCGGGAAAGA7ACAGAGG7AGA7AAAA77ACAGGG77AACA 

1800 
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IleGlyAlaAspAspTyrlleThrLysProPheArgProLeuGluLeuIleAlaArgVal 
SerAlaArgMetIleIle***ArgSerPrcPheAlaHis7rpSer*«*LeuLeuGly»*» 
ArgArgGly***LeuTyrAsnGluAlaLeuSerProThrGlyValAsnCysSerGlyLys 
ATCGGCGCGGATGATTATATAACGAAGCCCTT7CGCCCACTGGAG7TAATTGCTCGGG7A 
• 

LysAlaGlnLeuArgArgTyrLysLysPheSerGlyValLysGluGlnAsnGluAsnVal 
ArgProSerCysAlaAsp7hrLysAsnSerValGlu***ArcSerArgThrLysMetLeu 
GlyProValAlaProIleGlnLysIleGlnTrpSerLysGlyAlaGluArgLysCysTyr 
AAGGCCCAG77GCGCCGA7ACAAAAAA77CAG7GGAG7AAAGSAGCAGAACGAAAA7G77 

1900 . . 

IleValHisSerGlyLeuValIleAsnValAsn7hrHisGluCys7yrLeuAsnGluLys 
SerSer7hrProAlaLeuSerLeuMetleu7hrProMetSerValIle***7hrArgSer 
ArgProLeuArgProCysHis ** «Cys * * •HisFro* * *V*ILeuSerGluArgGluAla 
A7CG7CCAC7CCGGCC77G7CAT7AA7G77AACACCCA7GAG7377A7C7GAACGAGAAG 
* 

GlnLeuSerLeu7hrPro7hrGluPheSerIleLeuArgIleIeuCysGluAsnLysGly 
SerTyrProLeuLeuProProSerPheGln7yrCysGluSerSerValLys7hrArgGly 
ValIleProTyrSerHisArgValPheAsnThrAlaAsnProLeu***LysGlnGlyGlu 
CAGTTATCCCTTACTCCCACCGAG77T7CAATAC7GCGAATCC7C7G7GAAAACAAGGGG 
2000 .... 

AsnValValSerSerGluLeuLeuPheHisGluIle7rpGlyAspGlu7yrPheSerLys 
KetTrpLeuAlaProSerCysTyrPheMetArgTyrGlyAlaihrAsnlleSerAlaArg 
CysGly***LeuArgAlaAlaIleSer***AspMetGlyArgArgIlePheGlnGlnGlu 
AA7G7GG7TAGC7CCGAGC7GC7A777CA7GAGA7A7GGGGCGACGAA7A777CAGCAAG 

2100 



Fig. 5 (7/25) 



BLOV ET (703, 521-5940 
17/ 69 

SerAsnAsnThrlleThrValHisIleArgHisLeuArgGluLysMetAsnAspThrlle 
AlaThrThrProSerProCysIleSerGlyIleCysAlaLysLys**«ThrThrProLeu 
GlnGlnHisHisHisArgAlaTyrProAlaPheAlaArgLysAsnGluArgHisHis * * * 
AGCAACAACACCATCACCGTGCATATCCGGCATTTGCGCGAAAAAATGAACGACACCATT 

AspAsnProLysTyrIleLysThrValTrpGlyValGlyTyrLysIleGluLys***Lys 
IleIleArgAsnIle***LysArgTyrGlyGlyLeuValIleLysLeuLysAsnLysLys 
***SerGluIleTyrLysAsnGlyMetGlyGlyTrpLeu***Asn***LysIleLysLys 
GATAATCCGAAATATATAAAAACGGTATGGGGGGTTGGTTATAAAATTGAAAAATAAAAA 

2200 

LysArgLeuPheGlnThrArgThrLysThrLeuHisValTyrArgCysAsnCysCysGly 
AsnAspTyrSerLysLeuGluArgLysLeuTyrMetTyrlleValAlalleValValVal 
ThrThrIleProAsn***AsnGluAsn?he?hrCysIleSerLeuGlnLeuLeuTrp*»* 
AAACGACTATTCCAAACTAGAACGAAAACTTTACATGTATATC3TTGCAATTGTTGTGGT 

SerAsnCysIleArgValValTyrSerPheAsnAspProArgGluThrTrpGlyLeuAsp 
AlalleValPheValLeuTyrlleArgSerMetlleArgGlyLysLeuGlyAspTrpIle 
GlnLeuTyrSerCysCysIlePheValGln***SerGluGlyAsnLeuGlyIleGlySer 
AGCAATTGTATTCGTGTTGTATATTCGTTCAATGATCCGAGGGAAACTTGGGGATTGGAT 

2300 . 

LeuLysTyrPheGlyLysGlnIle***LeuLysSerProGlyArgAspGluIleIleSer 
LeuSerlleLeuGluAsnLysTyrAspLeuAsnHisLeuAspAlaMetLysLeuTyrGln 
***ValPheTrpLysThrAsnMetThr***IleThrTrpThrArg***AsnTyrIleAsn 
CTTAAGTATTTTGGAAAACAAATATGACTTAAATCACCTGGACGCGATGAAATTATATCA 

2400 
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Ile?heHisThrGluGlnTyrArgTyrLeuTyrLeuCysGlyXspCysHis«**TyrSer 
TyrSerUeArgAsnAsnlleAspIlePhelleTyrValAlalleVallleSerlleLeu 
IleProTyrGlyThrIle***IleScrLeuPheMetTrpArgLeuSerLeuValPheLeu 
ATATTCCATACGGAACAATATAGATATCTTTATTTATGTGGCGATTGTCATTAGTATTCT 

TyrSerMetSerArgHisAlaPheLysIleArgLysIleLeu***ArgAspLysTyrArg 
IleLeuCysArgValMetLeuSerLysPheAlaLysTyrPheAspGluIleAsnThrGly 
PhetyrValAlaSerCysPheGlnAsnSerGlnAsnThrLeuThrArg***IleProAla 
TAT7CTATGTCGCGTCATGCTTTCAAAATTCGCAAAATAC7T7GACGAGATAAATACCGG 

2500 

His«**CysThrTyrSerGluArgArg***ThrAsn***Ala?heCysGlyAsnGlyCys 
IleAspValLeuIleGlnAsnGluAspLysGlnlleGlule-jSerAlaGluMetAspVal 
LeuMetTyrLeuPheArgThrLysIleAsnLysLeuSerPheLeuArgLysTrpMetLeu 
CATTGATGTACTTATTCAGAACGAAGA7AAACAAATTGAGCT-TCTGCGGAAATGGATGT 

TyrGlyThrLysAlaGlnHisIleLysThrAspSerGlyLysAlaArgAlaGlyCysLys 
MetGluGlnLysLeuAsnThrLeuLysArgThrLeuGluLysArgGluGlnAspAlaLys 
TrpAsnLysSerSerThrHis***AsnGlyLeuTrpLysSerGluSerArgMetGlnSer 
TATGGAACAAAAGCTCAACACATTAAAACGGACTCTGGAAAAGCGAGAGCAGGATGCAAA 

2 600 

AlaGlyArgThrLysLy sLys * * *ArgCy sTyrValLeuGlyAlaArgTy r * * * AsnAla 
LeuAlaGluGlnArgLysAsnAspValValMetTyrLeuAlaBisAspIleLysThrPro 
TrpProAsnLysGluLysMetThrLeuLeuCysThrTrpAxgThrlleLeuLysArgPro 
GCTGGCCGAACAAAGAAAAAATGACGTTGTTATGTACTTGGCGCACGATATTAAAACGCC 

2700 
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ProTyrlleHisTyrArgLeuPheGluProAla* * * ArgGlySerArgHisAlaGlyArg 
LeuThrSerllelleGlyTyrLeuSerLeuLeuAspGluAiaProAspMetProValAsp 
LeuHisProLeuSerValIle***AlaCysLeuThrAxcLeuGlnThrCysArg***Ile 
CCTTACATCCATTATCGGTTATTTGAGCCTGCTTGACGAGGCTCCAGACATGCCGG7AGA 

SerLysGlyLysValCysAlaTyrHisValGlyGlnSerVaiSerThrArgThrAlaAsn 

GlnLysAlaLysTyrValHis I leThrLeuAspLy sAlaTyrAfgLeuGluGlnLeu He 
LysArgGlnSerMetCysIleSerArgTrpThrLysArgIleAspSerAsnSer"*Ser 
TCAAAAGGCAAAGTATGTGCATATCACGTTGGACAAAGCGTATCGACTCGAACAGCrAAT 

2800 

f 

ArgArgValPhe***AspTyrThrVal***ProThrAsnAs?AsnAlaAsnLysAsnAla 
AspGluPhePheGluIleThrArgTyrAsnLeuGlnThrlleThrLeuThrLysThrHis 
ThrSerPheLeuArcLeuHisGlyIleThrTyrLysArg'**Arg***GlnLysArgThr 
CGACGAG777T77GAGA77ACACGGTATAACC7ACAAACGA7AACGC7AACAAAAACGCA 

• ••••• 

HisArgProIleLeu7yrAlaGlyAlaAspA$pArg* * * IleLeuSerSerAlaPheArg 
HeAspLeuTyrTyrMetLeuValGlnMetThrAspGluP^e7yrProGlnLeuSerAla 
***7hr7yr7hrIleCys7rpCysArg***ProMetAsnPheIleLeuSerPheProHis 
CA7AGACC7A7AC7A7A7GC7GG7GCAGA7GACCGA7GAA7777A7CC7CAGC777CCGC 

2900 . 

ThrTrpLysThrGlyGlyTyrSerArgProArg^lySerAspArgValArgArgPro*** 
HisGlyLysGlnAlaValIleHisAlaProGluAspLeu7trValSerGlyAspProAsp 
MetGluAsnArgArgLeuPheThrProProArgIle***?roCysProAla7hrLeuIle 
ACA7GGAAAACAGGCGG77A77CACGCCCCCGAGGA7C7GACCG7G7CCGGCGACCC7GA 

3000 
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**«ThrArgGluSerLeu***GlnHisPheGluLysArgArcCysIleGln***Gly*«* 
LysLeuAlaArgValPheAsnAsnlleLeuLysAsnAlaAlcAlaTyrSerGluAspAsn 
AsnSerArgGluSerLeuThrThrPhe***LysThrProLeuHisThrValArgIleThr 
TAAACTCGCGAGAGTCTTTAACAACATTTTGAAAAACGCCGCTGCATACAGTGAGGATAA 

GlnHisHis***HisTyrArgGlyProLeuArgGlyCysGlyValAsnArgIleGlnGlu 
SerllelleAspIleThrAlaGlyLeuSerGlyAspValValSerlleGluPheLysAsn 
AlaSerLeuThrLeuProArgAlaSerProGlyMetTrpCysGlnSerAsnSerArgThr 
CAGCATCATTGACATTACCGCGGGCCTCTCCGGGGATGTGGTSTCAATCGAATTCAAGAA 

3100 

HisTrpLysHisProLysArg***AlaSerCysHisIle***lysValLeu***AlaGly 
ThrGlySerIleProLysAspLysLeuAlaAlaIlePheGl--LysPheTyrArgLeuAsp 
LeuGluAlaSerGlnLysIleSer"**LeuProTyrLeuIysSerSerIleGlyTrpThr 
CACTGGAAGCATCCCAAAAGATAAGCTAGCTGCCATATTTGAAAAGTTCTATAGGCTGGA 

• * • * " • 

GlnPheSerPhePheArgTyrGlyTrpArgGlyThrTrpIleSlyAspCysLysArgAsn 
AsnSerArgSerSerAspThrGlyGlyAlaGlyLeuGlyLeuAlalleAlaLysGluIle 
IleLeuValLeuProIleArgValAlaArgAspLeuAspTrpArgLeuGlnLysLysLeu 
CAATTCTCGTTCTTCCGATACGGGTGGCGCGGGACTTGGATTGGCGATTGCAAAAGAAAT 
3200 .... 
Ty rCysSerAlaTrpArgAlaAspLeuArgGlyLy sLeu * * * * » *LeuTy r AspVal * * * 
IleValGlnHisGlyGlyGlnlleTyrAlaGluSerTyrAscAsnTyrThrThrPheArg 
LeuPheSerMetGluGlyArgPheThrArgLysAlaMetlleThrlleArgArgLeuGly 
TATTGTTCAGCATGGAGGGCAGATTTACGCGGAAAGCTATGATAACTATACGACGTTTAG 

3300 
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GlyArgAlaSerSerAspAlaArgLeuGly * • * * * *LysGluValLeuArgAspVal7yr 
ValGluLeuProAlaMetProAspLeuValAspLysArgArcSer***GluMetTyrIle 

«**SerPheGlnArgCysGlnThrTrpi.euIleLysGlyGlyProLysArgCyslle**» 
GGTAGAGCTTCCAGCGATGCCAGACTTGG7TGATAAAAGGAGG7CCTAAGAGATGTATAT 

AsnPheLeuGlyLysSerGlnGly7yrLeuTyrPhePheLeuGlyAsn***GlnPheAsn 
IlePhe***GluAsnLeuLysValIlePheThrPheSer***GluIleAsnAsnLeuIle 
PhePheArgLysIleSerArgLeuSerLeuLeuPheLeuArgLysLeuThrIle***Tyr 
AATTTTT7AGGAAAATCTCAAGGTTATCTTTACTTTTTCTTAGGAAATTAACAATTTAAT 

3400 

IleLysLysArgLeuValLeuThrArg***Thr***TyrArcLysAsnGluProPheSer 
LeuArgAsnGlySerPheLeuHisGlyArgLeuAsnThrValArgThrSerArgPheArg 
**»Glu7hrAlaArgSerTyrThrValAspLeuIlePro***GluArgAlaValPheVal 
A77AAGAAACGGC7CG77C77ACACGG7AGAC77AA7ACCG7AAGAACGAGCCG7TT7CG 

PhePheArgGluArgPheAspLysIleThrlleGlylleProValLeuPheGlyAlaPhe 
SerSerGluLysAspLeuThrArgLeuProLeuAlaSerProPheTyrLeuValProPhe 
LeuGlnArgLysIle***GlnAspTyrHis7rpHisProArgPheIleTrpCysLeuSer 
TTCTTCAGAGAAAGA7TTGACAAGATTACCAT7GGCATCCCCG7TTTA7T7GGTGCC7TT 

3500 .... 
HisArgLysGlyTrpSer***Leu***IleThrSerAlaLeuLeuPheMetAspValSer 
7hrGluArgValGlyLeuAsnTyrGlu***HisArgHisTyrCysLeuTrpMet***Ala 
GlnLysGlyLeuValLeuIleMetAsnAsnIleGlyIleT3irVal7yrGlyCysGluGln 
CACAGAAAGGG77GG7C77AATTATGAA7AACA7CGGCATTAC7GTT7A7GGATGTGAGC 

3600 
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ArgMetArgGlnKetHisSerMetLeuPheArgLeuAlaLeuAlaLeuTrpGlnAxg"* 
Gly-**GlyArgCysIleProCysSerPheAlaSerLeuTrpArgTyrGlyAsnAspAsn 
AspGluAlaAspAlaPheHisAlaLeuSerProAxgPheGiyValMetAlaThrllelle 
AGGATGAGGCAGATGCATTCCATGC7CTTTCGCCTCGCTTTGSCGTTATGGCAACGATAA 

LeuThrProThrCysArgAsnProThrProAsnProAxgLeuSerlleAsnValSerVal 
»**ArgGlnArgValGlyIleGlnArgGlnIleArgAlaPteGlnSerMetTyrGlnCys 
AsnAlaAsnValSerGluSerAsnAlaLysSerAlaProFieAsnGlnCysIleSerVal 
TTAACGCCAACGTGTCGGAATCCAACGCCAAATCCGCGCCT7TCAATCAATGTATCAGTG 

3700 

TrpAspIleAsnGlnArgPheProProLeuPhePheLeuAxg* * *ArgGluProVal* * * 
GlyThr***IleArgAspPheArgLeuTyrSerSerCysAl2GluGluSerArgCysGlu 
GlyHisLysSerGluIleSerAlaSerlleLeuLeuAlalsuLysArgAlaGlyValLys 
TGGGACATAAATCAGAGATTTCCGCCTCTATTCTTCTTGCGCTGAAGAGAGCCGGTGTGA 

. . • • 

AsnIlePheLeuProGluAlaSerAlaAlaIleIle***IleGlnLeuLeuLeuArgGlu 

IleTyrPheTyrProLysHisArgLeuGlnSerTyrArgTyrAsnCysCys***GluAsn 
TyrlleSerThrArgSerlleGlyCysAsnHisIleAspTlirThrAlaAlaLysArgMet 
AATATATTTCTACCCGAAGCATCGGCTGCAATCATATAGATACAACTGCTGCTAAGAGAA 

3800 .... 
TrpAlaSerLeuSerThrMetTrpArgThrArgArglleAlaLeuProIlelleLeu*** 
GlyHisHisCysArgGlnCysGlyValLeuAlaGly***ArgCysArgLeuTyrTyrAsp 
GlylleThrValAspAsnValAlaTyrSerProAspSerValAlaAspTyrThrMetMet 
TGGGCATCACTGTCGACAATGTGGCGTACTCGCCGGATAGCGT7GCCGATTATACTATGA 

3900 
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Cy s * * *PheLeuTrpGln7yrAla7hr * * * AsnArgLeuCysAlaLeuTrpLy sAsnMet 
AlaAsnSerTyrGIySerThrGlnArgLysIleAspCysAlaLeuCysGlyLysThr*** 
LeuIleLeuMetAlaValArgAsnValLysSerlleValArgSerValGluLysHisAsp 
7GC7AA77C77A7GGCAG7ACGCAACG7AAAA7CGA77G7GCGC7C7G7GGAAAAACA7G 

• ••••* 
HeSerGlyTrpThrAlaThrValAlaArgTyrSerAlaThr***GlnLeuValTrpTrp 

PheGlnValGlyGlnArgProTrpGlnGlyThrGlnArgfiisAspSerTrpCysGlyGly 
- PheArgLeuAspSerAspArgGlyLysValLeuSerAspKetThrValGlyValValGly 
ATTTCAGGTTGGACAGCGACCGTGGCAAGGTACTCAGCGACATGACAGTTGGTGTGGTGG 

4000 

GluArgAlaArg***AlaLysArgLeuLeuSerGlyCysGluAspLeuAspValLysCys 
AsnGlyProAspArgGlnSerGlyTyr***AlaAlaAlaArcIleTrpMet***SerVal 
ThrGlyGlnlleGiyLysAlaVallleGluArgLeuArcGlyPheGlyCysLysValLeu 
GAACGGGCCAGA7AGGCAAAGCGG77A77GAGCGGC7GCGAGSA7T7GGA7G7AAAG7G7 

. • ... 

7rpLeuIleValAlaAlaGluVal***Arg***7hrMet7yrArgLeuMetSerCysCys 
GlyLeu***SerGlnProLys7yrArgGlyLysLeuCys7hrVal******ValAlaAla 
Ala7yrSerArgSerArgSerIleGluValAsn7yrValProPheAspGluLeuLeuGln 
7GGC77A7AG7CGCAGCCGAAG7A7AGAGG7AAAC7A7G7ACCG777GA7GAG77GC7GC 
4100 .... 
LysIleAlaIleSerLeuArgPheMetCysArgSerIleArgIleArg7hrIleLeuSer 
Lys***Arg7yrArg7yrAlaSerCysAlaAlaGln7yrGIy7yrAlaLeu7yr7yrGln 
AsnSerAspIleVal7hrLeuHisValProLeuAsn7hrAsp7hrHis7yrIleIleSer 
AAAA7AGCGA7A7CG77ACGC77CA7G7GCCGC7CAA7ACGGA7ACGCAC7A7A77A7CA 

4200 
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AlaThrAsnLysTyrArgGlu***SerLysGluHisPheLeu£erIleLeuGlyAlaVal 
ProArgThrAsnThrGluAsriGluAlaArgSerlleSerTyrGlnTyrTzpAlaArgSer 
HisGluGlnlleGlnArgMetLysGlnGlyAlaPheLeuIleAsnThrGlyArgGlyPro 
GCCACGAACAAATACAGAGAATGAAGCAAGGAGCATTTCTTATCAATACTGGGCGCGGTC 

* 

HisLeu***IleProMetSerTrpLeuLysHis***LysThrGlyAsnTrpAlaVaiPro 
ThrCysArgTyrLeu***ValGly***SerIleArgLysAxrGluThrGlyArgCysArg 
LeuValAspThrTyrGluLeuValLysAlaLeuGluAsnGlyLysLeuGlyGlyAlaAla 
CACTTGTAGATACCTATGAGTTGGTTAAAGCATTAGAAAACGC-GAAACTGGGCGGTGCCG 

4300 

HisTrpMetTyrTrpLysGluArgLysSerPheSerThrLexiIleAlaProLysAsnGln 
IleGlyCysIleGlyArgArgGlyArgValPheLeuLeu** w leuHisProLysThrAsn 
LeuAspValLeuGluGlyGluGluGluPhePheTyrSerAspCysThrGlnLysProIle 
CATTGGATGTATTGGAAGGAGAGGAAGAGTTTTTCTACTCTG.-.ITGCACCCAAAAACCAA 

* • • » • 

LeuIleIleAsnPheTyrLeuAsnPheLysGluCysLeuTbr-*****SerHisArgIle 
* * * * * *SerI lePheThr * * *ThrSerLy sAsnAla * * * Ar gAspAsnHisThrAlaTy r 
AspAsnGlnPheLeuLeuLysLeuGlnArgMetProAsnValllelleThrProHisThr 
TTGATAATCAATTTTTACTTAAACTTCAAAGAATGCCTAACG7GATAATCACACCGCATA 
4400 .... 
ArgProIlelleProSerLysArgCysVallleProLeuLysLysProLeuLysThrVal 
GlyLeuLeuTyrArgAlaSerValAla***TyrArg***LysAsnHis***LysLeuPhe 
AlaTyrTyrThrGluGlnAlaLeuArgAspThrValGluLysThrlleLysAsnCysLeu 
CGGCCTATTATACCGAGCAAGCGTTGCGTGATACCGTTGAAAAAACCATTAAAAACTGTT 

4500 
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TrpIleLeuLysGlyAspArgSerMetAsnArglleLysValAlalleLeuPheGlyGly 
GlyPhe***LysGluThrGlyAla***IleGlu***LysLeuGlnTyrCysLeuGlyVal 
AspPheGluArgArgGlnGluHisGlu * * * AsnLysSerCysAsnThrValTrpGlyLeu 
TGGATTTTGAAAGGAGACAGGAGCATGAATAGAATAAAAGTTGCAATACTGTTTGGGGGT 

CysSerGluGluHisAspValSerValLysSerAlalleGluIleAlaAlaAsnlleAsn 
AlaGlnArgSerMetThrTyrArg***AsnLeuGln***Arg***ProLeuThrLeuIle 
LeuArgGlyAla* * * ArglleGlyLys IleCy sAsnArgAspSer Arg* * *His ****** 
TGCTCAGAGGAGCATGACGTATCGGTAAAATCTGCAATAGAGATAGCCGCTAACATTAAT 

4 600 . . 

LysGluLysTyrGluProLeuTyrlleGlylleThrLysSerGlyValTrpLysMetCys 
LysLysAsnThrSerArgTyrThrLeuGluLeuArcAsnLeuValTyrGlyLysCysAla 
ArgLysIleArgAlaVallleHisTrpAsnTyrGluIleTrpCysMetGluAsnVaiArg 
AAAGAAAAATACGAGCCGTTATACATTGGAATTACGAAATCTGGTGTATGGAAAATGTGC 
• • * * . • 

GluLysProCysAlaGluTrpGluAsnAspAsnCysTyrSerAlaValLeuSerProAsp 
LysAsnLeuAlaArgAsnGlyLysThrThrlleAlalleGlnLeuTyrSerArgArglle 
LysThrLeuArgGlyMetGlyLysArgGlnLeuLeuPheSerCysThrLeuAlaGly*** 
GAAAAACCTTGCGCGGAATGGGAAAACGACAATTGCTATTCAGCTGTACTCTCGCCGGAT 
4700 .... 
LysLysMetHisGlyLeuLeuValLysLysAsnHisGluTyrGluIleAsnHisValAsp 
LysLysCysThrAspTyrLeuLeuLysArgThrMetAsnMetLysSerThrMetLeuMet 
LysAsnAlaArgIleThrCys***LysGluPro***Ile***AsnGlnProCys***Cys 
AAAAAAATGCACGGATTACTTGTTAAAAAGAACCATGAATATGAAATCAACCATGTTGAT 

4800 
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ValAlaPheSerAlaLeuHisGlyLysSerGlyGluAspGl%-SerIleGlnGlyLeuPhe 
***HisPheGlnLeuCysMetAlaSerGInValLysMetAs??roTyrLysValCysLeu 
SerIlePheSerPheAlaTrpGlnValArg***ArgTrpIleHisThrArgSerVal*** 
GTAGCATTTTCAGCTTTGCATGGCAAGTCAGGTGAAGATGGATCCATACAAGGTCTGTTT 

GluLeuSerGlylleProPheValGlyCysAspIleGlnSerSerAlalleCysMetAsp 
AsnCysProValSerLeuLeu***AlaAlaIlePheLysAlaGlnGlnPheValTrpThr 
IleValArgTyrProPheCysArgLeuArgTyrSerLysLeuSerAsnLeuTyrGlyGln 
GAATTGTCCGGTATCCCT7TTGTAGGCTGCGATATTCAAAGCTCAGCAATTTGTATGGAC 

4900 . 

LysSerLeuThrTyrlleValAlaLysAsnAlaGlylleAlaThrProAlaPheTrpVal 
AsnArg***HisThrSerLeuArgLysMetLeuGly***LeuLeuProProPheGlyLeu 
IleValAspIleEisArgCysGluLysCysTrpAspSerTyrSerArgLeuLeuGlyTyr 
AAATCGTTGACATACATCGTTGCGAAAAATGCTGGGATAGCTACTCCCGCCTTTTGGGTT 

• •«•*• 
IleAsnLysAspAspArgProValAlaAlaThrPheThrTyrrroValPheValLysPro 
LeuIleLysMetlleGlyArgTrpGlnLeuArgLeuProIleLeuPheLeuLeuSerArg 
****** Ar gik***** A l a QlyQlyS erT y r ValTyrLeuSerCysPheCys***AlaGly 
ATTAATAAAGATGATAGGCCGGTGGCAGCTACGTTTACCTATCCTGTTTTTGTTAAGCCG 

5000 . 

AlaArgSerGlySerSerPheGlyValLysLysValAsnSerAlaAspGluLeuAspTyr 
ArgValGlnAlaHisProSerVal***LysLysSerIleAlaArgThrAsnTrpThrThr 
AlaPheArgLeuIleLeuArgCysGluLysSerGln***ArgGlyArgIleGlyLeuArg 
GCGCGTTCAGGCTCATCCTTCGGTGTGAAAAAAGTCAATAGCGCGGACGAATTGGACTAC 

5100 
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AlalleGluSerAlaArgGlnTyrAspSerLysIleLeuIleGluGlnAlaValSerGly 
GlnLeuAsnArgGlnAspAsnMetThrAlaLysSer* * *LeuSerArgLeuPheArgAla 
Asn***IleGlyLysThrIle***GlnGInAsnLeuAsn***AlaGlyCysPheGlyLeu 
GCAATTGAATCGGCAAGACAATATGACAGCAAAATCTTAATTGAGCAGGCTGTTTCGGGC 

CysGluValGlyCysAlaValLeuGlyAsnSerAlaAlaLeuValValGlyGluValAsp 
ValArgSerValValArgTyrTrpGluThrValProArg***LeuLeuAlaArgTrpThr 
***GlyArgLeuCysGlyIleGlyLysGlnCysArgValSerCysTrpArgGlyGlyPro 
TGTGAGGTCGGTTGTGCGGTATTGGGAAACAGTGCCGCG7TAG7TGTTGGCGAGGTGGAC 

5200 

GlnlleArgLeuGlnTyrGlyllePheArglleHisGlnGluValGluProGluLysGiy 
LysSerGlyCysSerThrGluSerPheValPhelleArgLysSerSerArgLysLysAla 
AsnGlnAlaAlaValArgAsnLeuSerTyrSerSerGlySerArgAlaGlyLysArgLeu 
CAAATCAGGCTGCAGTACGGAATCTTTCGTATTCATCAGGAAGTCGAGCCGGAAAAAGGC 
* • • « » • 

SerGluAsnAlaVallleThrValProAlaAspLeuSerAlaGluGluArgGlyArglle 
LeuLysThrGlnLeu***ProPheProGlnThrPheGlnGlnArgSerGluAspGlyTyr 
* * *Ly s Ar gSerTyrAsnArgSer ArgAr gProPheSerArgGlyAlaArgThrAspThr 
TCTGAAAACGCAGTTATAACCGTTCCCGCAGACCTTTCAGCAGAGGAGCGAGGACGGATA 
5300 . 

GlnGluThrAlaLysLysIleTyrLysAlaLeuGlyCysArgGlyLeuAlaArgValAsp 
ArgLysArgGlnLysLysTyrIleLysArgSerAlaValGluVal***ProValTrpIle 
GlyAsnGlyLysLysAsnIle***SerAlaArgLeu***ArgSerSerProCysGlyTyr 
CAGGAAACGGCAAAAAAAATATATAAAGCGCTCGGCTGTAGAGGTCTAGCCCGTGTGGAT 

5400 
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MetPheLeuGlnAspAsnGlyArglleValLeuAsnGluVaiAsnThrLeuProGlyPhe 
CysPheTyrLysIleThrAlaAlaLeuTyr«**ThrLysSerIleLeuCysProValSer 
ValPheThrArg***ArgProHisCysThrGluAxgSerGlnTyrSerAlaArgPheHis 
ATGTTTTTACAAGATAACGGCCGCATTGTACTGAACGAAG7CAATACTCTGCCCGGTTTC 

ThrSerTyrSerArgTyrProArgMetMetAlaAlaAlaGlylleAlaLeuProGluLeu 
ArgHisThrValValIleProVal***TrpProLeuGlnValLeuHisPheProAsn*** 
VallleGlnSerLeuSerProTyrAspGlyArgCysArgTyrCysThrSerArgThrAsp 
ACGTCATACAGTCGTTATCCCCGTATGATGGCCGCTGCAGGTATTGCACTTCCCGAACTG 

5500 

IleAspArgLeuIleValLeuAlaLeuLysGly******AlaIrpLys«-*AspLeuLeu 
LeuThrAla***SerTyr***Arg"**ArgGlyAspLysHisGlyAsnArgIleTyrPhe 
«**ProLeuAspArgIleSerValLysGlyValIleSery.stGluIleGlyPheThrPhe 
ATTGACCGCTTGATCG7ATTAGCGTTAAAGGGGTGATAAGCATGGAAATAGGATTTACTT 

Phe***MetLys**"TyrThrValPheValGlyThrLeuAsnMetProLeuGlyIlelle 
PheArg***AsnSerThrArgCysSerLeuGlyArg***IleCysHisLeuGly***Phe 
LeuAspGluIleValHisGlyValArgTrpAspAlaLysTyrAlaThrTrpAspAsnPhe 
TTTTAGATGAAATAGTACACGGTGTTCGTTGGGACGCTAAATATGCCACTTGGGATAATT 

5 600 

SerProGluAsnArgLeuThrValMetLy s * * * I leAlaLeu* * *GlyHisThrSerTrp 
HisArgLysThrGly»**ArgLeu***SerLysSerHisCysArgAspIleArgValGly 
ThrGlyLysProValAspGlyTyrGluValAsnAxglleValGlyThrTyrGluLeuAla 
TCACCGGAAAACCGGTTGACGGTTATGAAGTAAATCGCATTGTAGGGACATACGAGTTGG 

5700 
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LeuAsnArgPhe***ArgGlnLysAsnTrpLeuLeuProLysGlyThrAspCysPheTyr 
***IleAlaPheGluGlyLysArgThrGlyCysTyrProArgValArgIleAlaSerMet 
GluSerLeuLeuLysAlaLysGluLeuAlaAlaThrGlnGlyTyrGlyLeuLeuLeuTrp 
CTGAATCGCTTTTGAAGGCAAAAGAACTGGCTGCTACCCAAG-3GTACGGATTGCTTCTAT 
• • • • * • 

GlyThrValThrValLeuSerValLeu***ThrValLeuCysAsnGlyLeuHisSerArg 
GlyArgLeuProSer***AlaCysCysLysLeuPheTyrAlaMetGlyCysThrAlaGly 
AspGlyTyrArgProLysArgAlaValAsnCysPheMetGlnTrpAlaAlaGlnProGlu 
GGGACGGTTACCGTCCTAAGCGTGCTGTAAACTGTTTTATGCAATGGGCTGCACAGCCGG 

5800 

LysIleThr***GlnArgLysValIleIleProIleLeuThrGluLeuArg***PheGln 
Lys***ProAspLysGlyLysLeuLeuSerGlnTyr***PrcAsn***AspAspPheLys 
AsnAsnLeuThrLysGluSerTyrTyrProAsnlleAspArgThrGluMetlleSerLys 
AAAATAACCTGACAAAGGAAAGTTATTATCCCAATATTGACCGAACTGAGATGATTTCAA 

LysAspThrTrpLeuGlnAsnGlnAlalleAlaAlaAlaValrroLeuIleLeuArgPhe 
ArgIleArgGlyPheLysIleLysPro***ProArgGlnCysHis***SerTyrAlaLeu 
GlyTyrValAlaSerLysSerSerHisSerArgGlySerAlalleAspLeuThrLeuTyr 
AAGGATACGTGGCTTCAAAATCAAGCCATAGCCGCGGCAGTGCCATTGATCTTACGCTTT 

5900 . . . . 

IleAsp***ThrArgValSerLeuTyrGlnTrpGlyAlaAspLeuIleLeuTrpMet:Asn 
SerIleArgHisGly***AlaCysThrAsnGlyGluProIle***PheTyrGly***Thr 
ArgLeuAspThrGlyGluLeuValProMetGlySerArgPbeAspPheMetAspGluArg 
ATCGATTAGACACGGGTGAGCTTGTACCAATGGGGAGCCGAT7TGATTTTATGGATGAAC 

6000 
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AlaLeuIleMetArcGlnMetGluTyrHisAlaMetLysArclysIleAlaAspValCys 
LeuSerSerCys61yLysTrpAsnIleMetGln***SerAl£LysSerGlnThrPheAla 
SerHisHisAlaAlaAsnGlylleSerCysAsnGluAlaGLnAsnArgArgArgLeuArg 
GCTCTCATCATGCGGCAAATGGAATATCATGCAATGAAGCGCAAAATCGCAGACGTTTGC 

• • • • • • 
AlaProSerTrpLysThrValGlyLeuLysHisIleAlaSerAsnGlyGlyThrMetTyr 

LeuHisHisGlyLysGlnTrpVal***SerIle***ProArgMetValAlaLeuCysIle 
SerlleMetGluAsnSerGlyPheGluAlaTyrSerLeuGiuTrpTrpHisTyrValLeu 
GCTCCATCATGGAAAACAGTGGGTTTGAAGCATATAGCCTCGAATGGTGGCACTATGTAT 

6100 

***GluThrAsnHisThrProIleAlaIleLeuIleSerPrcIeuAsnLysLeuLeuThr 
LysArgArgThr IleProGln * * *LeuPhe * * *PheProArr* * * I leAsnPhe * * *P ro 
ArgAspGluProTyrProAsnSerTyrPheAspPheProValLys***ThrPheAsnArg 
TAAGAGACGAACCATACCCCAATAGCTATTTTGATTTCCCC3TTAAATAAACTTTTAACC 

• • • • • • 
ValAlaArgThrAsnTyrIleSer***LeuPheArcGlnGlurhrArgArgMet***Leu 

LeuHisGlyGlnThrIle***AlaAsnSerPheGlyArgLysProAspValCysAsnTrp 
CysThrAspLysLeuTyrLysLeuThrLeuSerAlaGlyAJsnProThrTyrValThrGly 
GTTGCACGGACAAACTATATAAGCTAACTCTTTCGGCAGGAAACCCGACGTATGTAACTG 
6200 • 

ValLeuArgGluPheIleTyrSerArg***Tyr***ArgCysLysAlaGluArgTyrCys 
PheLeuGlyAsnLeuTyrlleValAspSerlleGluAspValArgGlnSerAspIleAla 
Ser***GlyIleTyrIle******IleValLeuLysMet***GlyArgAlaIleLeuArg 
GTTCTTAGGGAATTTATATATAGTAGATAGTATTGAAGATGIAAGGCAGAGCGATATTGC 

6300 
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GlyHisTyrLeuArgAlaLeuArgGlnAspSerLeuIleIleArgLeuIleAla***Arg 
ValIleIleCysValArgCysGlyLysIleAla*********Asp***SerHisArgGly 
SerLeuSerAlaCysAlaAlaAlaAxg***ProAspAsnLys7hrAspArgIleGluGly 
GGTCATTATCTGCGTGCGCTGCGGCAAGATAGCCTGATAATAAGACTGATCGCATAGAGG 
• • • • • • 

GlyGlylleSerHisArgProLeuSerThrGlySerSerAlaSerLeuAsnSerAlaTrp 
ValValPheHisThrAlaHisCysGlnGlnAlaValGlnPrcArg***IleGlnHisGly 
TrpTyrPheThrProProIleValAsnArgGlnPheSerLeuValLysPheSerMetGly 
GGTGGTATTTCACACCGCCCATTGTCAACAGGCAGTTCAGCCTCGTTAAATTCAGCATGG 

6400 

ValSerLeuMetLysIleHisLeuHisTrp************IleGln***GlyGluIle 
TyrHisLeu***LysPheIleTyrIleGlyAspAsnSerLysSerSerArgAlaLys*** 
IleThrTyrGluAsnSerSerThrLeuValllelleValA.sriProValGlyArgAsnAsn 
GTATCACTTATGAAAA77CATCTACATTGGTGATAA7AGTAA-.7CCAG7AGGGCGAAA7A 

IleAspCysAsnLeuArgGlyLysThrAlaGlnSerGlnThrArgLeuCysArgLeuArg 
LeuThrValIleTyrGlyAlaLysArgHisAsnLeuLysArgAspCysAlaVal***Gly 
***Leu***Phe7hrGlyGlnAsnGly7hrIleSerAsnGluIleValProPheLysGly 
A77GACTGTAA7T7ACGGGGCAAAACGGCACAA7C7CAAACGAGA77G7GCCG777AAGG 

6500 .... 
GlyArgPhe***LysTyrPheIleLeuProThrIle***LeuArgArgArgLeuLysMet 
GluAspSerArgAsnl leSer7yrPheGlnLeu7y rSer* "GlyGiyAsp* * *Lys * ** 
LysIleLeuGluIlePheHisThrSerAsnTyrlleValLysGluGluThrGluAsnGlu 
GGAAGA7TC7AGAAA7A777CA7ACT7CCAAC7A7A7AG77AAGGAGGAGAC7GAAAA7G 

6600 
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LysLysLeuPhePheLeuLeuLeuLeuLeuPheLeuIleTyrLeuGlyTyrAspTyrVal 
ArgSerCysPhePheTyrCysTyrCysTyrSer * * *TyrThr * * * ValMetThrThrLeu 
GluValValPhePheIleValIleVaiIleLeuAsnIleLeuArgLeu***LeuArg,*** 
AAGAAGTTGTTTTTTTTATTGTTATTGTTATTCTTAATATACTTAGGTTATGACTACGTT 
• • • • • • 

AsnGluAlaLeuPheSerGlnGluLysValGluPheGlnAsnTyrAspGlnAsnProLys 
MetLysHisCysPheLeuArgLysLysSerAsnPheLysIleMetlleLysIleProLys 
***SerThrValPheSerGlyLysSerArgIleSerLysLeu***SerLysSerGlnArg 
AATGAAGCACTGTTTTCTCAGGAAAAAGTCGAATTTCAAAA7TATGATCAAAATCCCAAA 

6700 . 

GluHisLeuGluAsnSerGlyThrSerGluAsnThrGlnGluLysThrlleThrGluGlu 
AsnIle***LysIleValGlyLeuLeuLysIleProLysArcLysGlnLeuGlnLysAsn 
ThrPheArgLys***TrpAspPhe***LysTyrProArgGluAsnAsnTyrArgArgThr 
GAACATTTAGAAAATAGTGGGACTTCTGAAAATACCCAAGAGAAAACAATTACAGAAGAA 
» » • * • • 

GInValTyrGlnGlyAsnLeuLeuLeuIleAsnSerLysTyrProValArgGlnGluVal 
ArgPheIleLysGluIleCysTyr***SerIleValAsnIleLeuPheAlaLysLysCys 
GlyLeuSerArgLysSerAlaIleAsnGln******IleSerCysSerProArgSerVal 
CAGGTTTATCAAGGAAATCTGCTATTAATCAATAGTAAATATCCTGTTCGCCAAGAAGTG 
6800 . 

* * *SerGlnIleSer * * * IleTy rLeuAsnMetThrAsn* * * * * *Met AspThrGlyCy s 
GluValArgTyrArgGluPheIle***Thr***ArgIleAsnLysTrpIleArgValAla 
LysSerAspIleValAsnLeuSerLysHisAspGluLeuIleAsnGlyTyrGlyLeuLeu 
TGAAGTCAGATATCGTGAATTTATCTAAACATGACGAATTAATAAATGGATACGGGTTGC 

6900 
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LeuIleValIlePheIleCysGlnLysLys***HisLysAsr.?heGlnArgTrpSerMet 
******** *Ty r LeuTy rVa lLysArgAsnSerThrLysI IsPheArgAspGlyGIn * * * 
AspSerAsnlleTyrMetSerLysGluIleAlaGlnLysFheSerGluMetValAsnAsp 
TTGATAGTAATATTTATATGTCAAAAGAAATAGCACAAAAATITTCAGAGATGGTCAATG 

• • • • » • 
MetLeu***ArgValAlaLeuVaIIleLeuLeuLeuIleVa lAlalleGluThrLeuMet 

CysCysLysGlyTrpArg***SerPheTyrTyr******Tr?LeuSerArgLeu****** 
AlaValLysGlyGiyValSerHisPhellelleAsnSerGlyTyrArgAspPheAspGlu 
ATGCTGTAAAGGGTGGCGTTAGTCATTTTATTATTAATAGTG^CTATCGAGACTTTGATG 

7000 . 

SerLysValCysPheThrLysLysTrpGlyLeuSerMetPrcTyrGlnGlnVallleVal 
AlaLysCysAlaLeuProArgAsnGlyGly***ValCysLe--ThrSerArgLeu****** 
GlnSerValLeuTyrGlnGluKetGlyAlaGluTyrAlaleuProAlaGlyTyrSerGlu 
AGCAAAGTGTGCTTTACCAAGAAATGGGGGCTGAGTATGCCriACCAGCAGGTTATAGTG 

• * • • » » 
SerIleIleGlnValTyrHis***Met***AspGlnAla***.-.rgLysTrpAsnGluPro 

Ala***PheArgPheIleThrArgCysArgIleLysLeuAspGluAsnGlyThrSerPro 
HisAsnSerGlyLeuSerLeuAspValGlySerSerLeuTirLysMetGluArgAlaPro 
AGCATAATTCAGGTTTATCACTAGATGTAGGATCAAGCTTGA3GAAAATGGAACGAGCCC 

7100 .... 
LeuLysGluSerGly***LysLysMetLeuGlyAsnThrGlySerPheTyrValIleGln 
***ArgLysValAspArgArgLysCysLeuGluIleArgValHisPheThrLeuSerArg 
GluGlyLysTrpIleGluGluAsnAlaTrpLysTyrGlyPhelleLeuArgTyrProGlu 
CTGAAGGAAAGTGGATAGAAGAAAATGCTTGGAAATACGGGT7CATTTTACGTTATCCAG 

7200 
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ArgThrLysGlnSer* * *GlnGluPhe 
GlyGlnAsnArgVa lAsnArgAsnSer 
AspLysThrGluLeuThrGlylleGln 
AGGACAAAACAGAGTTAACAGGAATTC 

7227 
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declaration, ^c£oer ©f Attorney anir "Petition 

Page 1 or > 

WE (I) the undersigned inventor(s), hereby declare(s) chat: 

My residence, post office address and citizenship are as stated below next to my name, 

We (I) believe that we are (I am) the original, first, and joint (sole) inventor(s) of the subject matter which 
is chime6 and for which a patent is sought on the invention entitled 

POLYPEPTIDES TMPT.Tr* ATPIO TT3 THP FlYPPF^T HM OP PPST qgn AKrrP , rp n 

GLYCOPEFTIDES, IN PARTICULAR IN GRAM- POSITIVE BACTERIA. NUCLEOTIDE 
SEQUENCE CODING FOR THESE POLYPEPTIDES AND USE FOR DIAGNOSIS. 

the specification of which 

EJ is attached hereto. 

□ was filed on as 

Application Serial No. 

and amended on 

^ was filed as PCT international application 

Number PCT/FR /00S5S 

on October gg f Igg] 

and was amended under PCT Article 19 

on (if applicable;. 

We (I) hereby state that we (I) have reviewed and understand the contents of the above-identified 
specification, including the claims, as amended by any amendment referred to above. 

We (I) acknowledge the duty to disclose information material to the examination of this application in 
accordance with Secrion 1.56(a) of Tide 37 Code of Federal Regulations. 

We (I) hereby claim foreign priority benefits under Section 119 of Title 35 United States Code, of any 
foreign application^ ) for patent or inventor's certificate listed below and have also identified below any 
foreign application for patent or inventor's certificate having a filing date before that of the application on 
which priority is claimed: 

Priority 

Day/Month /Year Claimed 
31/10/1990 8 Yes □ No 

O Yes □ No 

; □ Yes O No 

□ Yes O No 



COPY 



Application No. Country 
3013579 FRANCE 



1/90 
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Declaration 



We (I) hereby claim the benefit under Section 120 of Titie 35 United States Code, of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this application is not 
disclosed in the prior United States application in the manner provided by the first paragraph of Section 112 
of Title 35 United States Code, We (I) acknowledge the duty to disclose material information as defined in 
Section 1.56(a) of Title 37 Code of Federal Regulations, which occurred between the filing date of the prior 
application and the national or PCT international filing date of this application: 

Status (pending, patented, 
Application Serial No. Filing Date abandoned) 

PCT/FR 91/QQB55 29/10/1991 pending 



And we (I) hereby appoint: Norman F. Obion, Registration Number 24,618; Marvin J. Spivak, Registra- 
tion Number 24,913; C. Irvin McClelland, Registration Number 21,124; Gregory J. Maier, Registration 
Number 25,599; Arthur I. Neustadt, Registration Number 24,854; Robert C. Miller, Registration Number 
25,357; Richard D. Kelly, Registration Number 27,757; James D. Hamilton, Registration Number 28,421; 
Eckhard H. Kuesters, Registration Number 28,870; Robert T. Pous, Registration Number 29.099; Charles 
L. Gholz, Registration Number 26,395; Vincent J. Sunderdick, Registration Number 29,004; William E. 
Beaumont, Registration Number 30,996; Steven B, Kelber, Registration Number 30,073; Stuart D. 
Dwork, Registration Number 31,103; Robert F. Gnuse, Registration Number 27,295; jean-Paul Lavallcye, 
Registration Number 31*451; William B. Walker, Registration Number 22,498; and John H.O. Clarke, " 
Registration Number 17,373, our (my) attorneys, with full powers of substitution and revocation, to prose- 
cute this application and to transact all business in the Patent Office connected therewith; and we (I) 
hereby request that all correspondence regarding this application be sent to the firm of OBLON, SPIVAK, 
McClelland, MAIER & NEUDSTADT, P.C., whose Post Office Address is : Fourth Floor, 1755 Jefferson 
Davis Highway, Arlington, Virginia 22202. 

We (I) declare that all statements made herein of our (my) own knowledge are true and that all state- 
ments made on information and belief are believed to be true; and further that these statements were made 
with the knowledge that willful false statements and the like so made are punishable by fine or imprison- 
ment, or both, under Section 1001 of Title 18 of the United States Code and that such willful false state- 
ments may jeopardize the validity of the application or any patent issued thereon. 

ARTHUR Michel Residence: Q, rue du p Q ubourg St Mar tin 

NAME OF FIRST SOLE INVENTOR 75010 PARIS (France) 




Citizen of: SWITZERLAND- 



Signature of Inventor 



Date 



Post Office Address: — ^ — rao du Faubourg St- 
75010 PARIS (France) Ma rtin 
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DUKTA- MAJLEN Svlvie 



NAME OF SECOND 



INVENTOR 




Signature of Inventor 



Da 1 



MOLINAS Catherine 



NAME OF THIRD JOINT INVENTOR 



Signature of Invent 



3r 



Date 



Residence: Li Senti^r cl&* RnQfiigr^Tv 

94260 FRESNES (France) 

Citizenship: FRANCE 



Post Office Address: 

I, Sentier des Rossicrnols 

Srt?fiO FT?gSMES (Franco) 

Residence: 118 f «ie foECaflet 

75 018 PARI S (France) 



Citizenship: FRANCE 

Post Office Address: 118, rue Marcadet 
75018 PftlUS (Franco) 



COURVAT.TN PatrirA 



NAME OF FOURTH JOI JIT INVENTOR 



Signature of Inventor^ 




Date 



NAME OF FIFTH JOINT INVENTOR 



Residence: 13, mo Fmi lo DttolauM 

PARTS (Fr^rp) 

Citizenship: FRANCE 



Post Office Address: 13 f rue Ftoile Duclau 
75015 PARIS (France) 



Residence: 



Signature of Inventor 



Citizenship: 



Post Office Address: 



Date 



1/90 



